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Preface 


This book began during a visit of the first author to the second in Calgary 
in July of 2013. We had both considered separately the possibility of writing 
a book about the Lucas sequences, but during this visit, we agreed to collab- 
orate on such a project; we also agreed that we were far too busy to begin 
until several years into the future. The next occasion when we were able to 
meet was the 18th Conference of the Fibonacci Association in July, 2018, at 
Halifax, Canada. It was during this meeting that work on this book began in 
earnest. While we had hoped to complete this undertaking in about 3 years, 
many obstacles to our progress, not the least of which was the COVID-19 
pandemic, prevented its completion until now. 


We explain the rationale for this book in the introductory chapter, so 
we will only mention here that it is the result of a blending of the diverse 
interests of the authors in various aspects of the Lucas sequences; as such it 
is a somewhat unusual volume, but we hope it will provide its readers with 
a good introduction and a sense of the direction of research in this topic. 
For those readers who wish to delve more deeply into the subject matter, 
we have attempted to provide a thorough bibliography at the end of each 
chapter. Unfortunately, it is very possible that we, through inadvertence or 
ignorance, may not have cited some important contributions; we can only 
offer our apologies to our readers and the affected authors. 


We both have a great interest in the history of mathematics and hope 
that our readers will forgive our frequent excursions into this area. For those 
who also appreciate history, we have included an appendix which discusses 
Edouard Lucas, his times and his work. If he is known at all today, it is 
usually through his contributions to recreational mathematics, but we hope 
that this production will illustrate his achievements in other mathematical 
endeavors, particularly computational number theory. 


The origin of Prof. Williams’ interest in the Lucas sequences developed 
from a course in number theory given by Dr. T. M. K. Davison at the Uni- 
versity of Waterloo in 1966, while that of Prof. Ballot came from his thesis 


vii 
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work (1990-92) which originated from the study of the 1985 paper of J. 
Lagarias on the prime divisors of the Lucas numbers. Thus, we have each 
devoted a substantial portion of our academic careers to their study, an ac- 
tivity which has brought us enormous intellectual and personal satisfaction. 
It is our hope that the readers of this work will share in our delight in the 
wonderful properties of these remarkable mathematical objects. 


Caen, France Christian J.-C. Ballot 
Ottawa, ON, Canada Hugh C. Williams 
January, 2023 
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Chapter 1 
Introduction 


The secret of getting ahead is getting started. (Mark Twain) 


In his youth Albert Einstein spent a year loafing aimlessly. You don’t get anywhere 
by not ‘wasting time.’ (Carlo Rovelli) 


As this is a book about number sequences, we begin by defining a number 
sequence to be an ordered set of numbers. We will often use the notation 
(X,,) or sometimes simply X to denote the sequence Xo, X1, X2,...,Xn,--- 
This will be convenient because we will be dealing with recurrence se- 
quences whose elements are related to the previous elements in a clear-cut 
way. To define such a sequence, we require a rule, called a recurrence re- 
lation, to construct each element in terms of the ones before it. Further- 
more, enough initial elements must be given so that all subsequent ele- 
ments of the sequence can be computed by successive applications of the 
recurrence relation. For example, consider the Fibonacci numbers. These 
numbers are defined by the initial elements Fo = 0 and Fi = 1 and the 


recursion formula F,4; = Fy, + Fn-i, for n > 1. The sequence is thus 
0,1,1,2,3,5, 8,13, 21,... However, since F,-1 = Fn41 — Fn, we can compute 
F_y = -1,F_. = 1,F_3 = —2,...,F_» = (—1)"Fy. Thus, the complete 


Fibonacci sequence is 


2, 18, 5, 9; 1 1, 1 1 8G, B18 Bc 


Since many of the sequences that we will discuss can be extended in this way, 
we will occasionally use the notation (X,,) to represent the full sequence 


ey te ee ae Coe Cee Cee Cees 


It should be clear from the context which sequence is meant. 
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2 1 Introduction 


Perhaps as early as 1202, the Fibonacci numbers were presented by 
Leonardo Bonacci in his Liber Abbaci (Book of Calculation)’, as the solu- 
tion of a problem of computing the number of rabbits that would result at 
time n under a particular breeding model. Certainly, we know that they ap- 
peared in the 1228 edition of this book. Leonardo (c. 1170-c. 1240-1250), 
a thirteenth-century Pisan mathematician, was known by other names such 
as Leonardo Pisano and Leonardo Bigollo (Pisano), but he is best known 
today as Fibonacci, a contraction of filius Bonacci (son of Bonacci), invented 
by the French historian Guillaume Libri in 1838. (See Devlin [5] for more 
details concerning what little is known about Fibonacci.) We emphasize here 
that Fibonacci was not the first to discuss these numbers; they were known 
much earlier to Indian mathematicians. (See Singh [25].) Because of their 
intrinsic interest and their many applications, the Fibonacci numbers have 
been the object of a considerable body of research; indeed, an entire journal, 
the Fibonacci Quarterly, established in 1963, is devoted to the study of their 
many properties and those of recurrence sequences in general. In addition, 
we mention the books of Gilson [8], Koshy [11], Posamentier and Lehman 
[22], and Vajda [27], each of which is dedicated to the study of the Fibonacci 
and Lucas numbers. The sequence of Lucas numbers (Lp) satisfies the same 
recurrence relation as the Fibonacci numbers, but with initial terms Do = 2 
and L, = 1.4 The sequences (F;,) and (Z,) possess a variety of properties, 
many of which can be found in the encyclopedic [11]. We mention here that 
(L,) has even found an application in dynamical systems. See Puri and Ward 
[23], where it is shown that the only recurrence sequence satisfying the Fi- 
bonacci recurrence and realizable as the number of periodic points of a map 
must be a multiple of (L,). 


A very notable feature of the Fibonacci numbers is the number of algebraic 
identities that they satisfy. For example, consider the identity discovered in 
1680 by G. D. Cassini [11, pp. 85-86, 106, 114]: 


FR ~ Fn4aFr—1 = (-1)"™; 
this is a special case of the identity of Catalan 
Fr — FntmFn—m = (-1)"" Fin, 
which in turn is a special case of Vajda’s [27, (20a)] identity: 
FrFnti-j — FrtiPn—j = (-1)" 9 RF. 


A yet more general and more symmetric form can be found in Johnson [10]: 


+ The Liber Abbaci was first published in 1202, but no copies of this version of the book 
have survived. Several copies of the second 1228 edition, however, are extant. (See [5].) 

t The term “Lucas numbers” for the numbers 2, 1,3, 4,7, 11,... seems to have become more 
and more common in the mid-1960s in papers published in the Fibonacci Quarterly. 
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as long asa+b=c+d. Puta=n,b=n+i-j,c=n+i,d=n-— J, and 
r =n-—j to obtain Vajda’s identity. As can be seen in [11], there are many, 
many more identities involving the (F,,) and (L,,) sequences. 


By computing a table of the Fibonacci numbers and their corresponding 
prime divisors, Lucas was able to discover empirically several characteristics 
of these factors [28, Sect. 3.1], which he was later able to prove in [16]. Indeed, 
it is now known that the Fibonacci numbers possess a large number and 
variety of fascinating number-theoretic properties. 


In 1844, Gabriel Lamé [12], in his quest to bound the number of divisions 
needed in executing the Euclidean algorithm for determining the greatest 
common divisor of two integers, rediscovered the Fibonacci numbers. Up to 
1876, the Fibonacci sequence was known to Lucas as the Lamé sequence (or 
the Lamé series). In January of 1876, Lucas [14] published a most remarkable 
short paper (actually, an extended abstract), in which he determined the 
possible linear forms of primes which divide certain elements of the Lamé 
(Fibonacci) sequence, the Lucas numbers, originally defined as F2,,/F;, and 
denoted now by L,,. He mentioned the identities Dy+, = Dp, + Ln, and 
Lyn = L? — 2(-1)" and concluded the paper with the following (at that 
time) astonishing statement: 

Furthermore, it is important to remark that Theorems X and XI allow us to de- 

termine whether a number is prime or composite without making use of a table of 

prime numbers. It is with the aid of these theorems that I think I have proved that 
the number A = 2!?7 — 1 is prime. This number consists of thirty-nine digits, while 
the largest prime number currently known consists of only ten digits. This number 

is, according to Euler, equal to 2%! — 1. 

The aforementioned Theorems X and XI are stated without proof in the 
paper. Also, the largest prime known in 1876, but not to Lucas, was in fact 
a number of 14 digits (see Chap. 2 of Williams [28]), but the statement that 
he could prove A to be a prime was quite extraordinary because at the time, 
the number was considered to be very large. Certainly, proving A a prime by 
trial division would have been out of the question; indeed, it remained the 
largest known prime for another 75 years. 


In May of 1876, Lucas [15] introduced the sequences that would later bear 
his name. He let P and @ be coprime integers and a and £ be the roots 
of the quadratic polynomial f(x) = 2? — Px + Q with 6 = a — 6. He then 
defined what are now called the Lucas sequences (U;,) and (V,,) by using the 
functions 


Un = (a” — B)/6, Va =a" + B”. 


Since both U,, and V,, are symmetric functions of the roots of a polynomial 
with integer coefficients, they must both be integers for all nonnegative in- 
tegral values of n. He stated that when P = 1 and Q = —1, then U,, and 
Vp, are, respectively, the corresponding Fibonacci and the Lucas numbers. He 
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also listed a number of identities, including a generalization of the Cassini 
identity: 
Ue _ Un41Un-1 = 2 an 


It was in this paper that he pointed out that the sequence of Lamé had 
been discovered much earlier by Fibonacci. It has been called the Fibonacci 
sequence ever since, in spite of the priority of the Indian mathematicians of 
whose work Lucas was completely unaware. 


On reading [15], Genocchi [7] pointed out that Lucas’ functions were very 
similar to a pair of functions that he [6] had introduced several years earlier. 
Genocchi had anticipated several of Lucas’ findings and mentioned that ver- 
sions of his functions were known to Lagrange and Legendre. Nevertheless, 
because of the enormous number and variety of results involving (U;,) and 
(V;) that Lucas discovered from 1876 until about 1880, they are now named 
after him. In fact, it was during this period that he applied them in develop- 
ing tests for the primality of large integers, including what is now called the 
Lucas-Lehmer test for the primality of Mersenne numbers. For the most part, 
these tests were sufficiency tests, which, if they were satisfied, could establish 
that a number N of a certain special form is a prime. As Lucas well realized, 
these tests were quite novel for their time, because instead of having to trial 
divide N by a large number of integers, for example, all the primes less than 
VN, it was only necessary to compute some integer S and test whether N 
divides S. 


As it seems that Lucas was unaware of the previous work of Genocchi, La- 
grange, and Legendre, it is of some interest to investigate where the idea of his 
functions arose. He does not really address this in [15], instead emphasizing 
the similarity between his functions and the trigonometric circular functions 
(see Sect. 4.1). As early as 1872, undoubtedly as a result of his examination 
of the works of Fermat, he had sharpened his interest in Diophantine prob- 
lems. (See Lucas [13, 16, 18], where the problems in [13] are dealt with more 
extensively.) He was almost certainly aware at this time of “Fermat’s last 
theorem” (FLT). Indeed, much later (1891) in [20], he states: 


We had hoped to find in this study [of the Lucas sequences], through the prime 
decomposition of the expressions (a” +b”), a demonstration of Fermat’s last theorem 
concerning the impossibility of solving in integers the indeterminate equation 


cP +yP + 2? =0 


in which it suffices to assume that p denotes a prime. 


Thus, it appears that attempting to prove FLT was a motivating force be- 
hind his invention of the Lucas sequences and the development of their theory, 
but so also was his application of them in primality testing (see Sect. 3.1). 
Additional information concerning this matter can also be found in Sect. 6.1. 
It seems remarkable that Lucas did not mention anything about this in print 
until near the end of his truncated life (see the Appendix), but possibly he 
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was concerned about not revealing too much information until he had solved 
the problem or saving himself the embarrassment of failing to do so. In con- 
nection with this, it is a great pity that his early death prevented him from 
completing, among other projects, his proposed commentary on the works of 
Fermat, which he mentions on page vi of [19] and on the last page of [18]. 

Although much has been written about the Fibonacci and Lucas numbers, 
no monograph devoted to the properties of the Lucas sequences has ever 
appeared in print. This seems remarkable in view of the fact that almost all 
results concerning the Fibonacci and Lucas numbers have Lucas sequence 
analogues. There are, however, a few books, besides elementary textbooks, 
which include substantial sections dedicated to these sequences, particularly 
as they apply to primality testing. Some of these are Bressoud and Wagon 
[3], Crandall and Pomerance [4], Ribenboim [24], and Williams [28]. 


The objective of this book is to provide a much more thorough discussion 
of the Lucas sequences than is available in existing monographs. We will bring 
together a variety of results, which are currently scattered throughout the lit- 
erature. Various sections will be devoted to intrinsic arithmetic properties of 
these sequences, primality testing, density problems, and the problem of gen- 
eralizing them. Furthermore, their application, not only to primality testing 
but to integer factoring, solution of quadratic and cubic congruences, cryp- 
tography, and Diophantine equations, will be briefly discussed. Throughout 
the book, we will include a sprinkling of historical comments, where relevant. 


Much of the book is not intended to be overly detailed. Rather, our ob- 
jective is to provide a good, elementary, and clear explanation of the subject 
matter without too much ancillary material. Most chapters, with the excep- 
tion of the second and the fourth, will address a particular theme, provide 
enough information for the reader to get a feel for the subject, and supply ref- 
erences to more comprehensive results. We have also attempted to make most 
of this work accessible to anyone with a basic knowledge of elementary num- 
ber theory and abstract algebra. Our intended audience is number theorists, 
both professional and amateur, students, and enthusiasts. We emphasize that 
this book was never intended to be a textbook; its focus is either much too 
narrow or too broad for that, but it might be used as supplementary reading 
for students enrolled in second or more advanced courses in number theory. 


Lucas sequences are very simple mathematical objects. They are sequences 
of integers, where each term of the (U;,) or the (V,,) sequence is a fixed 
weighted sum of the two previous terms. That is, for all n, we have 


Xn+2 = PXp41 _ QXn,; 


where X is a U or a V Lucas sequence. One would naturally expect them to 
have mainly additive properties. However, many of their key features arise 
from their divisibility properties. Several chapters of this book, in one manner 
or another, show or exploit themes rooted in the multiplicative nature of these 
sequences. For example, U,, is a multiple of U, whenever m is a multiple of 
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n, if m and n are positive integers. We saw that the celebrated Fibonacci 
sequence and its companion sequence, the sequence of Lucas numbers, form 
one instance of a pair of Lucas’ functions for P = 1 and Q = —1. But putting 
P=2 and Q = 1 yields U, = n and V, = 2 for all n. This observation, 
i.e., the fact that the sequence of natural numbers is a Lucas sequence, has 
been an abundant source of discovery of properties of the Lucas sequences and 
probably still holds many more future findings. We may call this phenomenon 
the n > U,, theme. For instance, the simple remarkable identity 2n + 1 = 
(n + 1)? — n?, which has the Fibonacci cousin identity Fon41 = F2,, + F?, 
generalizes to U2n41 = U2,,-—QU?, for an arbitrary Lucas sequence U(P, Q). 
The trivial identity (2n) = 2-n generalizes to U2, = V,,- Un for a generic 
pair (U,V) of Lucas sequences. In the 1860s, Wolstenholme discovered that 
given a prime p > 5, if we express the rational number ae 1/n, in lowest 
terms as A/B, A and B natural numbers, then A is a multiple of p?. Clearly, 
the sum ee 2/n has the same property and is a particular instance of the 
more general sum SS V,/Un, where the rank p of the prime p in the Lucas 
sequence (U,,) is the least positive index t¢ for which p | U;. Remarkably, if 
pis p or p+1, then the numerator of the rational number Pyaar Vi/Un is 
a multiple of p?, (p > 5). We illustrate the n + U,, theme with two more 
examples. 


Two (disjoint) products of consecutive positive integers (containing each at 
least two terms) are never equal except in one instance. At least one direction 
of the statement is easily checked. Indeed, 


14-15=5-6-7, (1.1) 


and both products are 210. Thus, one is led to investigate the corresponding 
question in the context of a general Lucas sequence U. Given two nonzero 
integers P and Q and U = U(P,Q), solve the Diophantine equation 


UmUm+1 mea Um+s = UnUn41 ane Un+t; (1.2) 


in positive integers m, n, s, and t, with s and ¢t at least two. Corollary 1.3 
and Table 1 p. 293 of [26] tell us, among other things, that no other solutions 
to (1.2) exist, besides (1.1). 

An integer sequence X has the Pillai property if there is a (minimal) bound 
bx > 2such that for all k > bx, the sequence X has k consecutive terms such 
that none is prime to the product of the k—1 others. The sequence of natural 
numbers I, (J, = 7), is a Pillai sequence with by; = 17. (See [2, 21].) Hajdu 
and Szikszai proved [9, Thm. 1] that every nondegenerate Lucas sequence 
U(P,Q), i-e., such that U, 4 0 ifn > 1, satisfying gcd(P, Q) = 1, is a Pillai 
sequence. For most parameters P and Q, by is again equal to 17, but there 
are two infinite families of parameters (P,Q) with, respectively, by = 25 and 
by = 43. For the Fibonacci sequence, be = 25. 
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The two foregoing topics, and a number of other results concerning Lucas 
sequences, are not treated in this book. This is to say that, although this 
book gathers together the basic theory of Lucas sequences and certain special 
topics, applied and theoretical, that are dear to the authors, it was not our 
intention to give an exhaustive account of the subject, much of which can be 
found in number theory journals, and in particular, but not exclusively, in 
Integers, the Fibonacci Quarterly, and the Journal of Integer Sequences, all 
of which provide a free online access to their papers, with a modest current 
Fibonacci fee (i.e., $55 or $89) for access to the last 5 years of the Fibonacci 
Quarterly. 


We begin, in the next chapter, by providing a thorough and rigorous ac- 
count of the basic identity and arithmetic features of the Lucas sequences 
summarized in Lucas’ memoir [17]. We will not, however, record a great 
many of the large number of identity properties involving (U,,) and (V,,) that 
Lucas discovered—only the ones that we found useful in the development of 
our theme. For those who are interested in identities, we refer the reader to 
[17] and Chap. 4 of [28]. Many more identities can be found in various articles 
of the Fibonacci Quarterly, but also in the Journal of Integer Sequences or 
in Integers, and to a lesser degree in the Journal of Number Theory and the 
International Journal of Number Theory. For example, the first formula of 
[28, (4.2.4)] is the Lucas function version of Catalan’s identity: 

UZ - 


n 


Unim Un—m = orm. 


an identity which occurs as (32) in [17]. Also, [28, (4.2.14)] yields the Lucasian 
analogue of Vajda’s formula: 


UnUns+i-j _ On+iUn—-j = Q” Uj. 


A short proof of the Lucasian analogue of Johnson’s formula can be found in 
[1, Lemma 6]: 


UU, a UUa = Q"’(U4 rUp rr U. Ug #)> 


which holds provided a+b=c+d. 


Furthermore, we do not always insist that gcd(P,Q) = 1. Some of the 
results that we will mention, particularly the divisibility properties of V,,, 
seem not to have been known to Lucas, but it is difficult to know this for 
certain because he stated in [18, p. 44] that he had produced an additional 
20 sections for [17] which he had not yet published (and never did). 


In Chap. 3, we start with a discussion of the Mersenne numbers. We next 
describe several applications of the Lucas sequences to such problems as 
primality testing, solving certain congruences modulo a prime, and integer 
factorization. We then look at how these sequences have been used to deal 
with some particular Diophantine equations and with the problem of secure 
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communication. As in any of these applications, it is essential that we be able 
to compute remote terms in the Lucas sequences modulo some fixed number; 
we show how this can be done efficiently. 


Chapter 4 consists of several miscellaneous results concerning the Lucas 
sequences. These include the relationship of these sequences to the circular 
functions, the Chebyshev polynomials, and the Dickson polynomials. We also 
provide some further arithmetic properties of the Lucas sequences, briefly 
describe Lehmer’s extension of these sequences, and discuss the primitive 
divisor theorem. We conclude by considering the problem of finding those 
moduli for which (U;,) has a full set of residues. 


Chapter 5 is an in-depth examination of the Lucasnomials and their prop- 
erties. These objects, which can be considered as generalizations of the bi- 
nomial coefficients, owe their integrality to a remark made by Lucas, but 
beyond that, he did not provide results concerning them. 


Chapters 6, 7, and 8 are devoted to various aspects of Lucas’ problem of 
finding a generalization of his sequences. Several of Lucas’ ideas are discussed 
at some length, and some candidates as possible generalizations are provided. 


Every prime number not dividing Q divides some of the terms U,,. How- 
ever, if infinitely many primes divide terms of a given V sequence, infinitely 
many do not. Chapter 9 explains how starting with initial questions asked in- 
dependently by Sierpiriski for (2”-+1) and by Ward for (Z,,), Hasse, and later 
Lagarias came up with a remarkable method that gives the exact proportion 
of primes that divide a V sequence. For instance, on average, 17 out of 24 
primes divide some term of the form 2" +1. A generalization of the method 
of Hasse and Lagarias to computing the density of primes which divide s — 1 
consecutive terms for some specific linear recurrences of order s, (s > 2), is 
also explained. 


In Chap. 10, we provide a brief overview of the book and a selection of 
unsolved problems or open investigations. 


Finally, in the Appendix, we provide a short biography of Lucas’ life, where 
we mention some of his personality traits and discuss his legacy. 


Every chapter starts with some quotations. They should be viewed as 
condiments to the main text. We have limited ourselves to realms that may 
supposedly be of interest to our readership. Many of us teach and, thus, have 
to communicate effectively; many do research and many enjoy mathematics; 
others yet are students of mathematics. Consequently, these passages apply to 
mathematics, the sciences, research, creativity, the learning process, writing, 
telling, teaching, communication, perseverance, intuition, and even poetry. 
Partially in honor of Edouard Lucas, we have often elected not to translate 
quotations from French authors into English. 
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Chapter 2 
Basic Theory of the Lucas Sequences 


A mathematician is a pattern searcher. That’s all we do all day. 


(Marcus de Sautoy) 
Understanding is a kind of ecstasy. (Carl Sagan) 


Abstract This chapter discusses the fundamental properties of Lucas se- 
quences. Those who have worked with, or who have had an interest in, Lucas 
sequences will be aware of a good part of the material in the chapter, but we 
hope to bring about a degree of precision and completeness concerning these 
basic properties along with proofs. Regular, standard, degenerate, as well as 
general Lucas sequences are defined. Their most basic and important iden- 
tities are reviewed; divisibility properties and the p-adic valuation of their 
terms are studied for all primes p, whether regular or special. The notion of 
the rank and rank exponent of a prime, the Lucas laws of appearance and 
repetition, the lifting-the-exponent lemma, as well as the Euler criterion for 
Lucas sequences are encountered along the way. We did not seek the most 
concise exposition, but rather one that brings some insight and is strewn with 
remarks. In fact, we often give several proofs of the same result, providing 
elementary ones always, but occasionally also proofs that use basic algebraic 
number-theoretic concepts. Of course, these results are of use, in particular, 
in all chapters of the book. 


2.1 Definition 


The theory and properties of the Lucas sequences are scattered in various 
places in the literature and were already largely present in Lucas’ work 
[12, 13], although not expressed as clearly as by later authors (Carmichael 
[3], D. H. Lehmer [10], etc.) or by those who, here and there, taught classes 
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in number theory that included this subject. For instance, Raphael Robinson 
took classes from D. N. Lehmer at the University of California at Berkeley 
which included some theory of Lucas sequences. The theory of Lucas se- 
quences was taught by Robinson himself in the 1950s at the same university. 
John Brillhart, who attended Robinson’s lectures, taught such a class at the 
University of Arizona in the fall of 1993, which one of the authors of this book 
had the opportunity to attend. Part of the basic theory of Lucas sequences 
has also appeared elsewhere more recently: in two sections of the book of 
Hardy and Wright [7, Sect. 10.14; Sect. 15.5], in Ribenboim’s original paper 
[15], in Williams’ book [17, Chap. 4], and in Ballot’s paper [1]. The p-adic 
valuation of terms of the Lucas sequences treated here in Sects. 2.4, 2.5, 2.6, 
and 2.7 has appeared in [11] for the Fibonacci and Lucas numbers, in [1, 
Sect. 3.2.1], in [17, Chap. 4], and even more recently in [16]. But some mate- 
rial has either seldom or never appeared in published form yet, particularly 
in Sects. 2.9 and 2.12. 

We emphasize some notation used in the chapter. If a and 6 are rational 
integers, we say that a divides b and write a | b, provided there is a rational 
integer c such that b = ac. In particular, 0 | a@ implies a = 0. Given a 
rational prime p and an integer n, its p-adic valuation, v,(n), is the largest 
nonnegative integer v such that p” | n. Thus, p’t! { n. We put v,(0) = +00. 
We also write p” ||n. If m/n is a rational number, then its p-valuation is the 
valuation difference v,(m) — vp(n). Thus, v2(—24) = 3, v3(2/9) = —2, and 
v5(7/6) = 0. Within the rational integers Z, the relation m ~, n, or m~ n 
(base p), says that the integers m and n have the same p-adic valuation. 
Hence, 3 ~3 42, 2 ~3 1, and 75 ~ 25 (base 5). If n ~, 0, then n = 0. We 
make frequent use of the Iverson symbol [—], which is the Boolean function 
that assigns 1 to [P], if P is a true statement, and 0, if P is false. 


Given two nonzero rational integers P and Q, one defines a pair {U,V} of 
Lucas sequences. They both satisfy the same binary recursion 


Xue = Pag = OX, orn = 0, (2.1) 


One is called the fundamental Lucas sequence. The other is the companion, 
or the associate Lucas sequence. Following Lucas and tradition, U(P, Q), or 
more briefly the letter U, denotes the fundamental sequence, while V(P, Q), 
or V, denotes the companion sequence. The sequence U = (U,,)n>0 satisfies 
the initial conditions 

Up = 0, U, = 1. (2.2) 


The initial conditions that define the companion sequence are 
Vo=2,V,=P. (2.3) 


Note that all terms U,, and V,, n > 0, are integral. We often refer to U, = 
U,(P, Q) and V,, = V;,(P, Q) as the Lucas functions. In the interest of brevity, 
we will often use the symbols U,, and V,, without including the parameters 
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P and Q when it is understood from context what P and @ are or when we 
refer to U,(P,Q) and V,,(P, Q) for any P and Q. 

Lucas and, subsequently, many authors developed the theory with the hy- 
pothesis that the parameters P and Q are coprime. The Lucas sequences are 
said to be regular whenever gcd(P, Q) = 1. Although regular Lucas sequences 
possess stronger properties, we do not generally choose this approach because 
much of the Lucas theory remains valid, or “locally” valid for primes p not 
dividing gcd(P, Q), for nonregular sequences. In fact, the mathematical liter- 
ature contains many papers that prove results about regular Lucas sequences, 
which hold just as well for nonregular Lucas sequences. 

We let D = P? — 4Q denote the discriminant of the characteristic poly- 
nomial «? — Px + Q and a and £ denote its zeros.’ Thus, D = (a — 8)?, 
P=a+8, and Q =a. Throughout this work, we will write 6 for a— 8 so 
that 


a=—— and B=—_. (2.4) 
(The usual convention when D > 0 is to choose a > £.) 


If D £0, then the closed forms of U, and V,,, their so-called Binet form, 
symmetric in the two zeros, are 


ft en 
On GB 


Remark 1 Thus, the integrality of all U, terms, n > 0, generalizes the fact 
that for all rational integers x and y, x — y divides x” — y”. 


and = V,=a"+p". (2.5) 


Remark 2 One sees that the fundamental sequence is, up to a constant, the 
difference of two exponential functions, while the companion is the sum of 
these two same functions. The sine and the cosine functions, respectively, 
obey the same property. This explains the title of the famous memoir of Lucas 
[12]. As Lucas had observed [12, Sect. II], this remark provides an explanation 
for the existence of a correspondence between trigonometric identities, or 
hyperbolic identities, and the Lucas identities involving the sequences U and 
V. In some substantial way, the arithmetic of a Lucas pair {U, V} constitutes 
a discrete version of trigonometry. 


For some results, it is either necessary or convenient to assume that all 
terms U,,, n > 1, are nonzero. Such sequences are said to be nondegenerate. 
On the contrary, if some U, = 0 with n > 3, then U is called degenerate. 
A complete description of degenerate cases is done in Sect.2.11. Had we 
allowed the parameter P to be 0, all our theory would have carried over; but 
if P = 0, then U2, = 0 and Uan41 = (—Q)”. One subsequence is constant and 
the other geometric. Doing without this degenerate case and, thus, assuming 
P =U, #0 does no harm. 


+ Lucas uses the capital Greek letter A for the discriminant. However, for better homo- 
geneity with the Roman letters Un, Vn, P, and Q, we prefer the Roman letter D. 


14 2 Basic Theory of the Lucas Sequences 


If D is zero, then the double root a is the integer P/2. In this case, the 
closed forms become 


U,=no"* and Vp, = 2a”. (2.6) 


Note that one may obtain (2.6) using the Binet form (2.5) for U,, and V,. 
Indeed, U,, is a symmetric polynomial in a and § since 


so that taking the limits limg_,. Un and limg-,q V;, in (2.5) yields (2.6). 


Whenever D # 0, we refer to U (and V) as being nonzero-discriminant or 
standard Lucas sequences. Thus, nonstandard Lucas sequences satisfy (2.6), 
whereas standard ones satisfy (2.5). 


Example 1 The pairs {(I,),2}, {(Mn), (2" + 1)}, and {(F;,), (Zn)} are par- 
ticular {U,V} pairs. Here, I, := n is the identity sequence, M,, := 2” — 1 is 
the sequence of Mersenne numbers, and F' denotes the Fibonacci sequence 
and L the sequence of Lucas numbers. The three corresponding values of 
(P, Q) are, respectively, (2,1), (3,2), and (1, -1). The numbers a which 
appear in Gaussian or g-binomial coefficients, if one sets the parameter q to 
be a fixed integer, form the Lucas sequences U(q + 1,q).! 


2.2 Most Common Identities 


Because of the correspondence with trigonometry, identities are plentiful. 
The purpose of this section is only to present those identities which appear in 
many contexts. They are the most important. These include the Pythagorean, 
the double-angle, the addition, the subtraction, and several multiplication 
formulas. The latter three give Xmin, Xm—n, and Xmn in terms of Xi», 
X*, Xn, and X*, where {X, X*} = {U,V}. All the formulas shown in this 
section except for the identity (2.42) can be found in Lucas’ work, and we 
occasionally point out exactly where. 

The two Binet forms (2.5) and (2.6) permit us to extend the definition of 
Lucas sequences to negative indices. 

Thus, we obtain formulas that correspond to sin(—x) = —sina and 
cos(—2a) = cosx, namely, for all integers n 


+ Nowhere in Lucas’ writings is there a hint of his being aware of Gaussian coefficients 
even though one could have fancied his use of the letters P and @ came from there. 

* Doing so corresponds to running the recursion (2.1) backward. Indeed, (x — a~!)(x — 
B-) =2?-Q-!Px2+Q7!. But U_(n42) = Q71PU_(n41) — Q-1!U_» is equivalent to 
U_n = PU_n-1 — QU_n_-2. 
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Q’U_,»=—U, and Q"V_»n=Vp. (2:7) 
The former formula in (2.7) is seen to hold because, if D 4 0, then 
6Q”U_n = (aB)"(a™ — B™™) = B® — a” = —6Un, 
while, if D = 0, then 
Q"U_n = 0°". (—n)a "1 = —na”} = -U py. 


Similarly, 
Q"V_n = (aB)"(a-” ae Bo) = ge te a" = \. 


The Pythagorean formula cos? x + sin? « = 1 is embedded in the form 
V2 — DU? = 4Q”. (2.8) 
From 6U, = a” — 6" and V, = a” + B", we obtain the two identities 


Va + dU, = 20”, 


2.9 
Va — 6U, = 28", 22) 


which when multiplied together yield (2.8). If a = 6, ie., if 6 = 0, then (2.8) 
is readily seen to hold. 
In correspondence with the trigonometric identities 
sin(x + y) =sinzcosy + sinycosz, 


cos(a + y) = cosxcosy — sina sin y, 
we have the so-called addition formulas of Lucas [12, Sect. IX] 


WU racin = Um Va + Un Ven: (2.10) 
2Vinen = VimVn + DU mUn. (2.11) 


The subtraction formulas are 


2Q°Um=n = UmVn = UnVm; (2.12) 
2Q"Vin-n = VinVa — DUmUn- (2.13) 


To prove these or other identities, we may use the Binet formulas for 
U, and V,,, but another method which we will refer to as the RIC method 
(RIC standing for “recursion plus initial conditions” ) is often handy. If there 
are several variables as in (2.10) where we have the two variables m and 
mn, we fix one of the variables to an arbitrary value. Then we compare the 
recursions satisfied by both the left and the right sides of the equation. If 
those recursions agree and have order r, then we compare initial conditions 
or the coincidence of any r consecutive values. For instance, for (2.10), say 
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we fix m. Then the functions n+ 2U,4, and n> Uy, + Vn + Vin» Un both 
satisfy the second-order recursion (2.1). We compare initial conditions, say 
at n = 0 and n = 1. For n = 0, both sides are equal to 2U;,. Replacing n by 
1 in (2.10), we obtain 2U,,41 = PUm + Vin. But this identity itself is easily 
checked by RIC. Both sides are functions of m that satisfy the same order- 
two recursion and take the values 2 and 2P for the respective values 0 and 1 
of m. This proves the summation formula for fundamental Lucas sequences. 
The RIC technique works just as well for identity (2.11). 

The subtraction formulas are a direct consequence of the addition formu- 
las. For instance, using (2.10) and (2.7), we find that 


2Q"Um—=n = gO Umea) = Q”(UmV_n + U_nVm) 


Remark 3 At this point, we wish to clarify the extent to which all formulas 
in this section are valid. They hold for all Lucas sequences including degen- 
erate, zero-discriminant, or nonregular cases. Moreover, most are valid for all 
integral indices, not just nonnegative indices, unless otherwise stated. Also, 
the limit argument by which we obtained the Binet forms (2.6) from (2.5) 
may seem somewhat dubious: On what 6-support do we take the limit as 
B — a? Besides, a is often not a fixed target as 6 changes. But the argu- 
ment can be made rigorous. Indeed, a formula involving U and/or V terms 
at a few indices as well as possibly P, Q, and D can be transformed into a 
two-variable expression in a and ( equal to 0. Once the formula is proved 
for D # 0, then we may consider the polynomial in two variables that results 
from putting z = a and y = 8. Occasionally, it may not be a polynomial, but 
by a small manipulation, it can be made to be a polynomial, e.g., ifn <0 in 
(2.12), then multiply out by Q~”. Thus, we have a polynomial equal to 0 for 
infinitely many values of x and y. Therefore, it is zero for all values of x and 
y, and, in particular, the identity holds for « = y. Although checking out the 
formulas in the D = 0 case is often easier than the D 4 0 case and may be 
fun to perform, it is not necessary. 


Subtracting side to side (2.12) from (2.10) yields 


Um+n = VinUn a Q’Um_n; (2.14) 
whereas the subtraction of (2.13) from (2.11) gives 


Vintn = DUmUn + Q"°Vin—n- (2.15) 
Also adding side to side (2.12) and (2.10) yields 


Similarly, the addition of (2.13) and (2.11) gives 
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Vintn = VmVn — Q°Vin—n- (2.17) 
On interchanging n with m in, respectively, (2.16) and (2.17), we obtain 
for either X = U or X = V 
Ns Ve Mi OO aes (2.18) 
In particular, replacing n by m(n + 1) in (2.18) yields 


When m = 1, we recover the basic defining recursion of the Lucas pair 
(U,V). But it also says, given any m > 2 and provided U,,, 4 0, that the se- 
quences U! = Umn/Um and V, = Viny form a derived pair of Lucas sequences 
associated with the parameters P’ = V,, and Q’ = Q™. Their characteristic 
polynomial x? — V,,2 + Q™ has zeros a” and 3”. 

In particular, we obtain the useful identities 


and 
Vinn(P, Q) = Vi (Vim, Q”). (2.21) 


Remark 4 We observe that the two identities (2.20) and (2.21) are straight- 
forward to see. Assume a™ 4 6™. Then 
qin — ee _ qm — Bo (a™)* _ (gm 


Umn( P,Q) = = ae es ee ae (2.22) 


whereas 
Vinn(P, Q) = (a’™)" + (B"")" = Va(Vn, Q™). 


Putting m = n in (2.14), or in (2.10), we obtain the so-called double-angle 
identity 
U2n, = UnVn- (2.23) 


By the Binet formula for V and the Pythagorean identity, we readily find 
identities that correspond to cos(2r) = 2.cos? « — 1 = 1 — 2sin? x, namely, 


Von = V2 —2Q" = DUZ +2Q". (2.24) 
Adding V,? — 2Q” to DU? + 2Q” yields 
2Von, = V2 + DUP, (2.25) 


standing for cos(2a) = cos? x — sin? x. 
Replacing m by n and n by 1 in both (2.14) and (2.15) yields 
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Un4+1 aa QUy-1 = V,, and Vn-+1 — QVn-1 = DU,. (2.26) 


The famous Cassini! identity F,_1F 41 — F? = (—1)" for the Fibonacci 
numbers has a more general form for U sequences, and an analogous identity 
exists for sequences V. These are 


UW? —Up Ups = Q™ and V2 -VeuVesi =—-—DQ*". | (2.27) 


They can be proved by various techniques, e.g., using the Binet formulas, 
or an inductive reasoning, or matrices. For instance, if D 4 0, we see that 


D(Up — Un—1U npr) = (a — 6")? — (a8 — B*Y(a"t — pm?) 
= -2Q" + (a? + B°)Q"™ 


= -2Q" + (P?— 2Q)Q""" = DQ", 


which yields the first identity on cancelling out D. 
Because, as can be verified by induction, one has for all integers n 


P -Q ‘< _ Un+1 —QU,, 
1 0 ~ Un —QUy-1 , 

we may equate the determinants on both sides and see that 
Q” = —Q(U n4i1Un-1 a Us) 


yielding again the first identity in (2.27). 
We now give one pair of multiplication formulas of Lucas [13, Sect. 183, p. 
319] valid for all integers m > 0 and all integers n 


[(m—1)/2] 


m 
gm-liy — DrU2zKt1ym—2k-1 2.28 
Sage BURN, 2.28) 
and 
|m/2| a 
ae one. (2.29) 
k=0 


The two identities (2.28) and (2.29) are usually proved by either sub- 
tracting or adding the two sides of (2.9) raised to the mth power. Thus, for 
instance, with a subtraction, we obtain 


+ The Fibonacci numbers were known to a number of early investigators, but not under that 
name. They were often discussed as being components of the convergents to the continued 
fraction expansion of the golden ratio: (1 + V5)/2. 
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(2a”)™ _ (28")™ = (Vin T oun) ~~ (Vn _ 0U,) 


k=0 


m 
=—2 yoo. 
& Cone) enna 


which after division by 26, assuming D # 0, yields the multiplication formula 
(2.28). 

Lucas [13, Sect. 179, p. 312] had observed that U,, is a homogeneous poly- 
nomial of degree n — 1 in P and Q, while V,, is a homogeneous polynomial 
of degree n in P and Q, provided in both cases the degree in Q is counted 
with double weight. That is, U, and V, are homogeneous polynomials in P 
and /Q of respective degrees n — 1 and n, where powers of \/Q are always 
even. We provide these polynomials in P and Q for 1 < n < 6 in Table 2.1. 


Table 2.1 U, and Vy, as polynomials in P and Q 
n Un Vn 
I I P 
2 P P? —2Q 
3 P?—Q P? — 3PQ 
4 P® — 2PQ P* — 4P?Q + 2Q? 
5 P* — 3P?Q4+ Q? P® —5P°Q+5PQ? 
6 P® — 4P3Q + 3PQ? P® — 6P4Q + 9P2Q? — 2Q3 


In general, we find for all n > 0 


where 


L(n—1)/2] Sa ih 
B= IPQ) = DY Coe )prteigt — @ai 


Actually, if (X,,) satisfies the recursion (2.1), then it follows from the 
identity Y,42 = PW,41 — QW, and induction on n that for all n > 1, 


Xn = W,X1 — QWy_-1X0, (2.32) 
where we adopted the convention YW = Up = 0. 
Hence, V,, = PW,, — 2QW,,-1, which, using 


Oo ae eee (2.33) 
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yields V,,, for all n > 1, as the homogeneous polynomials in P and /@Q on 
the right-hand column of Table 2.1, i.e., 


The multiplication formulas (2.28) and (2.29) have the peculiarity of bear- 
ing a power of 2 in front of Um» and Vinn. But Lucas [12, Sects. XII-XII]], 
[13, Sect. 183, p. 318] also considered another set of multiplication formulas 
expressing Ujyn/Un and Vinn as polynomials in V, and Q. These formulas 
extend the double-angle formula (2.23). They correspond to the de Moivre 
formulas giving sin(ma) and cos(mz:) as polynomials in cos x, as Lucas himself 
pointed out [12, Sects. XII, p. 208]. Thus, for m > 1 and n > 1, we have 


Umn = Um 


Lim<D/2l 7 pg 
k 


Jienkgenv, (2.35) 
k=0 


and 


[n/2| ye: tee B 
k 


) (1° Orv. (2.36) 


To derive (2.35), one can use the identities (2.30) and (2.31) with P = Vin 
and @ = Q”™. Hence, 


(Vins) = Soy (" Wa = re 


k 
kee 


We note that (2.34) can also be viewed as a particular instance of the 
formal polynomial identity in A and B: 


tate n n—k 
An Br= —1 k 
bs dot ) —=( k 


)(aBya + Byr-2*, (37) 
k=0 


(Identity (2.37) can also easily be derived by appealing to an instance of the 
Girard-Waring formula—see, for example, [6].) Lucas used (2.37) to produce 
yet another pair of multiplication formulas. Replacing n by m in (2.37), and 
then putting A = a” and B = —£", we get for odd m 


(m—1)/2 


m m—k ei 3 (DRE es 
a= 2 ‘ Jar Dye ate eee (2.38) 


m—k 


and for even m 
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m/2 


m m—k 
Vian = kn py(m—2k) /2 m—2k 2. 
er . )e ur (2.39) 


Also, 2”-1U,, and 2”—'V,, can be expressed as homogeneous polynomials 
in P and D, if degrees in D are doubled. Indeed, 


a? Ox :) pr—2k-1 pk (2.40) 
k>0 
and 
on = @ pn-2k pk (2.41) 
k>0 


As Lucas did [13, p. 317], this can be proved by expanding the binomial 
formula (2.4) on a and J, ie., 


2”6U, = 2"(a” — 8”) = (P +6)” — (P— 6)” 
= TV) pn-k sk \ Enki s\k 
“E()roe-E (ees 
k k 
_ TU) on-k sk _ nm n—2k—-1 pk 
2 (i)? 6 = 295 (1)? D*, 
k odd k>0 


which upon division by 26 yields (2.40). Formula (2.41) can be obtained by 
expanding similarly 2"V,, = (P +6)” + (P— 0)”. 
Another useful identity, valid for all integers m and n, is given below 


Un = ntiUm—n - QU;Um—n-1:- (2.42) 


The RIC method works well. (Fix m to an arbitrary integer value. Then 
check the identity for the two consecutive values m and m+ 1 of n using 
(2.7).) We observe that (2.42) is also often used in the equivalent form 


Um+n = UmUn+1 _ QUm—1Un.- (2.43) 
Putting m = n+ 1 in (2.43) yields the identity 


Vensi = Uns — QUa: (2.44) 


which for the Fibonacci sequence F’ = U(1,—1) gives the classical formula 


Ponti = FR + Frat (2.45) 


Not to prejudice the V-side of Lucas sequences, we provide an identity 
somewhat like (2.42), but for companion sequences, namely, 


Vin = ntiVm—n = QUrVn—n-1- (2.46) 
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2.3 General Divisibility Properties 


Although the Lucas sequences are second-order linear recurrences and as 
such are defined additively, they, perhaps surprisingly, possess divisibility 
properties. In fact, these multiplicative properties form an important key 
feature of these sequences. 

A sequence of integers (Xn)n>o0, satisfying Xo = 0 and 


min => Xm | Xn, (2.47) 


for all 0 < m < n is said to be a divisibility sequence, or less frequently a 
divisible sequence. Note, in particular, that if X,, = 0, then X, = 0 for all 
multiples n of m. 

We assume the integers m and n to be nonnegative throughout this section. 


Lemma 1 The parity of Un and V,, n> 0, ts given by the rules: 


Q even and P odd = > all X,, n> 1, are odd. 
Q and P even => all Xn, n> 0, are even, except for Uj. 
PQ odd => 2|X, <> 3] n. 
Q odd and P even = > 2| Vz, for alln >0, and 2|U, <= > 2|n, 


where X stands for U or V. 


Proof These implications are readily seen by applying the recursion (2.1) 
modulo 2. For instance, for the last two where @ is odd, we find that if P is 
odd, then both U and V (mod 2) are 


OG A ea OR 


In particular, 2| X, <> 3|n, where X is either U or V. If P is even, then 
all V terms are even, while only even-indexed U terms are even. 


Theorem 2 The sequence U = U(P,Q) is a divisibility sequence, i.e., 


m|n = > Um | Un. 


Proof This is a direct consequence of identity (2.20) or of formula (2.35). 


Remark 5 An alternate proof of Theorem 2 may be obtained from the multi- 
plication formula (2.28). Say n = km. Then applying the formula to Upm, we 
see that U,,, divides 2*—1U,,. Thus, if U,, is odd, then the theorem follows. If 
Um is even, then, by Lemma 1, V;, is also even. Hence, each term of the sum 
DG Vl (,* ,) Diuz+1Vk-2-1 is individually divisible by 2-1Ujn. 


If a > 2 is an integer, then it has been known at least since the time of 
Fermat that 
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sed(a"™ —1,4" —1) = gO") — 4, (2.48) 


Indeed, without loss of generality, we may suppose m > n > 0. Say m = qn+r 
with 0 <r <n. To show (2.48) holds, it suffices to check that 


gcd(a”™ — 1,a” — 1) = gcd(a” — 1,a" — 1). 
From the identity 
a” —1=alt" —1 =a"(a")4 —a" +a" —1=a"((a")?—-1) +0" - 1, 


we see that any common factor of a” — 1 and a” — 1 is also a common factor 
of a” — 1 and a” — 1 and vice versa. 

After dividing equation (2.48) through by a—1, we obtain that U(a+1,a) 
is a strong divisibility sequence, i.e., a sequence of integers (X;z):>0 such that 
for all m > n > 0, we have 


gced(Xm, Xn) = |edtoa nil (2.49) 


The next theorem pins down a necessary and sufficient condition on P and 
Q which makes U(P, Q) strongly divisible. 


Theorem 3 For all m > n> 0, the sequence U(P,Q) satisfies 
gced(Um, t) = (Ef sccttin se) if and only if gced(P, Q) =1. 


A Lucas sequence U(P, Q) with gcd(P, Q) = 1 is called regular. 
Before proving Theorem 3, we establish a few useful lemmas. 


Lemma 2 /fU is regular, then 


gced(Un, Q) = ged(Vn, Q) =1, 
for alln > 1. 


Proof Let X be U or V. By (2.1), we see that Xn42 = PXn+1 (mod Q), 
(n > 0). As Uj = 1 and V; = P, a simple induction shows that U, = P"~! 
(mod Q) and V,, = P” (mod Q) for all n > 1. Thus, if a prime p divides 
both X, and Q, then p | P”. Thus, p | gcd(P,Q). This contradicts the 
hypothesis. 


Lemma 3 Let U, V be a pair of Lucas sequences. Then for alln > 0, 
ged(Un, Vr) | 2Q'/7!, 
Proof The lemma is a direct consequence of the Pythagorean identity (2.8) 


V7 — DU? = 4Q”. 
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Lemma 4 /f U is regular, then for all n > 1, 

gcd(Un, Vn) = 1 or 2 
and gcd(Un, Vn) = 1, if, in addition, Q is even. 


Proof By Lemma 2, no factor d > 1 of Q may divide U,,. Hence, we see the 
result holds by Lemma 3. 


First proof of Theorem 8. If gced(P, Q) > 1, then ged(P, P?—Q) = gcd(U2, U3) 
> 1, while Ugcai2,3) = U1 = 1. Thus, regularity is necessary for U to be 
strongly divisible. 

We now show regularity is sufficient, and thus, we assume gcd(P, Q) = 1. 
By Theorem 2, Ugca(m,n) | g¢d(Um, Un). Thus, assuming as we did prior to 
the theorem, that m = qn+r,0< r<_n, by the Euclidean algorithm, all 
we need to show is that G := gcd(Um,U,,) divides U,.. Since m — r = qn, we 
obtain by the subtraction formula (2.12) that 


V,Um — 2Q"U gn = VinUr- (2.50) 


Since G | U,, and, by Theorem 2, G | Ugn, we find that G | V,,U,. If 
gcd(Um, Vn) = 1, then G is prime to V,, so G divides U,.. Thus, by Lemma 4, 
we may now assume @ odd and gcd(Um, Vin) = 2. Let us write G = 2”g with 
g odd. As g is prime to Vin, g | U,. Thus, it remains to see that 2” | U,., 
assuming v > 1. 

We use Lemma 1 when Q is odd. If P is odd, then as both U,, and U,, are 
even, m and n are multiples of 3. Thus, 3 | r. Hence, 2 | U,. If P is even, then 
the evenness of U,, and U, means that m and n are even. Thus, r is even 
and again 2 | U,. There is nothing more to prove if vy = 1. Hence, suppose 
vy > 2. Then 4| U,,. So 2|| Vm. But whatever the parity of P, 2 | U, implies 
V, is even. Hence, 2”*+ divides the left-hand side of (2.50) and so 2” | U,. 

However, a simpler proof of the core part of Theorem 3, i.e., of the fact that 
P and Q are coprime implies gcd(Um,Un) = +Ugca(m,n), is often achieved 
using identity (2.42). But we need another preliminary lemma. 


Lemma 5 If gcd(P,Q) = 1, then gcd(Un, Un+1) = 1 for alln > 0. 


Proof We proceed by induction. Indeed, ged(Up, U1) = 1. Say ged(Un, Un+1) 
= 1 for some n > 0. If ged(Un+41, Un+2) > 1, then some prime p divides Un+1 
and Unie = PUn+1—QUy. Thus, p | QU;,. But by Lemma 2, p | U,,+41 implies 
pt Q. Hence, p | U;,, which contradicts ged(U,, Un41) = 1. 


Second proof of Theorem 3. To prove that gcd(Um,Un) = +Ugca(myn); it 
suffices to show that gcd(Um, Un) = gcd(Um_n, Un). But by (2.42), 


gcd(Um, Un) — gcd(Un41Um—n = QU,Um-n-1; Un) 
= gcd(Un41Um—n, Un) = gcd(Um_—n; Un), 
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where the last step follows from Lemma 5. 


Companion sequences are not divisibility sequences. Nevertheless, they 
satisfy a weaker divisibility condition given in Theorem 4. This theorem gen- 
eralizes the well-known fact that for a and b integers and m odd, we have 


a” +b” divides a™” + 0™", 
Theorem 4 Let V be a companion Lucas sequence. Then 


m odd => V,,| Vinn- 


Proof By (2.33), all terms of the sum appearing on the right-hand side of 
(2.36) are integers. Hence, for m odd, we see that V,, divides V, 


Second proof of Theorem 4. We can also wrestle out a proof from the multipli- 
cation formula (2.29). Indeed, (2.29) tells us that V,, divides 2”~!Vinn. If Vn 
is odd, then V,, | Vinn- If V;, is even, then either P is even, or both P and Q are 
odd by Lemma 1. If P is even, then 4 | D so that each term of the sum (2.29) 
being a multiple of D*V,”—?*—1Y,, is a multiple of 4*2™-2*-ly,, = 2™~1y,,. 
If PQ is odd, then, by Lemma 1, 3 | n and 2 | U,. But each term of the 
right-hand side sum of (2.29) contains the factor U2*V,"—-?k-1V,, which is 
divisible by 2”-1Vp. 

Third proof of Theorem 4. Using congruences in the ring of integers O of the 
root field Q(a) and bearing in mind that OM Q = Z, the proof is simple. 
Indeed, V, = a" + 8" = > a” = —6" (mod V,,). Raising the congruence to 
the odd mth power, we obtain a” = —8"™” (mod V,,). That is, Viz | Vinn- 
Fourth proof of Theorem 4. By Galois theory, any symmetric polynomial 
expression in the algebraic integers x = a” and y = 8” must be a rational 
integer. Now 


-1 m1 
gm 1-— A Y=), ( a 1 ty? 
1=0 1=0 
Since (—1)' = (—1)™-1~#, the sum >" (-1)'a™-1-‘y# is a symmetric 
i=0 y y 


polynomial in x and y. Thus, V;, | Vinn- 

There is also a weaker form of Theorem 3 which applies to companion 
sequences when we consider indices m and n that belong to the same 2-adic 
class, i.e., that have the same 2-adic valuation. 


Theorem 5 Let gcd(P,Q) =1, m>n> 1. Then 


mr-yen => gcd(Vin, Vn) = [Vecdtmaseh 
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We postpone the proof of Theorem 5 till Sect.2.9, where it is a direct 
consequence of Corollary 30 and of the fact that no regular prime dividing Q 
may divide any V term. 


It is often useful to be aware that Theorem 3 remains nearly true when 
gcd(P, Q) > 0. A prime p is said to be regular if and only if p+ gcd(P,Q). A 
prime that divides gcd(P, Q) is called special. 


Lemma 6 Suppose p is a regular prime. Then either p{ Q, orp | Q and does 
not divide any Un, n> 1. 


Proof If p| Q, then pf P since p is regular. Therefore, as seen in Lemma 2, 
U;, = P™~! (mod p). The conclusion follows. 


Lemma 7 If p{ Q, then gcd(Un, Un41) is prime to p for alln > 1. 


Proof This follows by induction for p { U, and if p { gcd(Un,Un41), then 
pt gcd(Un41, Un+2). Indeed, if p | Un41 and p | Un+2, then p | PUn+1— QUn, 
which implies p | U,, and leads to a contradiction. 


Theorem 6 Let U(P,Q) be a fundamental Lucas sequence and p be a regular 
prime. Then for allm >n> 0, we have 


gcd(Um,Un) ~ Uged(m,n) (base p). (2.51) 


Proof Suppose p | Q. Then no U,, n > 0, is divisible by p by Lemma 6. 
Hence, (2.51) holds. Thus, assume p { Q. Because Ugea(m,n) divides Um and 
Un, we find that vp)(Ugca(myn)) < Yp(gcd(Um, Un)). So it suffices to see that 
p®* | ged(Um, Un) implies p® | U;,—n. As in our second proof of Theorem 3, we 
use identity (2.42) by which p* | U,41Um-—n. By Lemma 7, p { Un4i. Thus, 


Oo | Ussas 


Corollary 7 If @ > 2 is an integer prime to Q, then for all m > n> 0, 
£| gcd(Um,Un) => €| Uged(m,n): 


Proof By the hypothesis, all prime divisors of @ are regular. So by Theo- 
rem 6, if p | ¢, then the p-adic valuations of gcd(Um,Un) and of Ugca(myn) are 
identical. Thus, the corollary must hold. 


Definition 1 (Rank of appearance) Let m > 2 be an integer. We define 
the rank, or the rank of appearance of m in U, denoted as p = py(m), to 
be the least positive index ¢ such that m | U;, whenever such an integer p 
exists.! 


+ Lucas used the letter w for the rank, maybe, or maybe not, in connection with the period 
of elliptic functions. We prefer p, the Greek r, for rank and reserve the Roman r for a 
variable name. 
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Lucas called it in French le rang d’apparition de m, and the French word 
“apparition” has been—and still is—widely in use among English writers. 
However, in English, more so than in French, the word “apparition” has a 
ghostly or miraculous connotation which is best to avoid: There is nothing 
supernatural about how integers divide terms of Lucas sequences. The rank 
p is sometimes called the entry point of m in U. In this case, the letter z often 
denotes the entry point. 


Lemma 8 If m > 2 is an integer coprime to Q, then the rank p(m) exists. 


Proof We need to show the existence of a positive integer t such that U; = 0 
(mod m). The sequence (U;, (mod m)) may only take at most m? distinct 
pairs of consecutive values. Therefore, from some point on, it must repeat 
itself going forward and be periodic since U (mod m) is determined by two 
consecutive values of (U, (mod m)). However, because gcd(Q,m) = 1, the 
inverse of Q (mod m) is well defined, and the recurrence (U;,, (mod m)) may 
be run backward as well as forward. Indeed, U_(n42) = PU nat) — 
Q-!U_». Thus, it is fully periodic over Z. Since Up = 0, the value 0 (mod m) 
must repeat itself in both directions. In particular, there is a minimal t > 0 
for which m | U;. 


We now establish an important theorem concerning p(m). This theorem 
does not bear a traditional name, but we refer to it as the law of regular 
division since it says that a regular divisor m divides the terms of the sequence 
U precisely every p term. 


Theorem 8 [fm > 2 is prime to Q of rank p, then 
m|Un => p|n. 


Proof Clearly, m | U;, if and only if m | ged(U,,U,). By Corollary 7, this 
occurs if and only if m | Ugca(n,p). By the definition of p,m | Ugcacn, p) is true 
if and only if p = gcd(n, p), ie., if and only if p | n. 


2.4 Divisibility of U, by Powers of an Odd Prime 


From Lemma 8, if p { Q is a prime, then we know all powers p*, a > 1, do 
divide some terms of U(P,Q). Moreover, we know which terms are divisible 
by a given prime power p* once we know its rank by Theorem 8. In this 
section, we gather more information on the values of the ranks of primes and 
see precisely how the rank of a prime power p**? relates to the rank p(p*) of 
the previous power p*. 

If p is special, then p divides all U,, n > 2. If p| Q, but p{ P, then no U,,, 
n > 1, is divisible by p by Lemma 6. To express the law of appearance for 
primes not dividing Q, we use the symbol e, to denote the Legendre character 
(D |p). That is, 
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0, if p| D; 
€» = (D|p) = 41, if D is a nonzero square modulo p; (2.52) 
—1, if Dis not a square modulo p. 


We begin with a lemma. 


Lemma 9 /[f p is an odd prime, then 

Up=& (modp) and V,=P (mod p). 
Proof As the binomial coefficient (oe) is divisible by p unless k = (p—1)/2, 
we obtain by setting n = p in (2.40) that 2°-!U, = D°z~ (mod p). This 
implies U, = € (mod p). Similarly, (2.41) gives 2?-1V, = (8)P? (mod p), 
yielding V, = P (mod p). 


Remark 6 Rather than using Equations (2.40) and (2.41), we may use con- 
gruences in the ring of integers O of the field Q(a). Indeed, V, = a? + 6? = 
(a+ 6)? = P? = P (mod pO) implying V, = P (mod p) since V, and P are 
rational integers. Likewise, assuming p{ D, we find that 


Uy = GaP - (a — gy! = DF = ey (nod 70). 


Theorem 9 (Law of appearance for primes) Suppose p{ Q is a prime 
of rank p. Then we find that 


if p> 3, then p dividesp—€p, and 
if p= 2, ten = | ei 
3, aif P is odd. 

Proof The case p = 2 is immediate as U, = P and U3 = P? — Q. Suppose p 
is an odd prime. We are about to see that p | Up—c,. If €p = 0, then p | Up by 
Lemma 9. If ep = +1, then Equation (2.12) gives 2Q°Up_<, = UpVe, —Ue, Vp- 
That is, for €, = 1, 2QU,_-1 = PU,—V, = Pe»—P =0 (mod p), by Lemma 9. 
For €) = —1, we get 2Q7~'Up41 = U,pV_1 — U_1V,p, which when multiplied by 
Q yields 2Up41 = UpVit Vp = &P+P =0 (mod p), by (2.7) and Lemma 9. 
Thus, by Theorem 8, we conclude that p divides p — ep. 


Remark 7 The content of Theorem 9 can be found in [12, Sect. XXV] which 
Lucas titled “De V’apparition des nombres premiers ...” So Theorem 9 is 
Lucas’ so-called law of appearance. 


Remark 8 One may view the law of appearance for primes as a near general- 
ization of Fermat’s little theorem, that is, of the Fermat congruence a?~+ = 1 
(mod p), for integers a such that p{ a. If U = U(a+1,a) and p{ a(a— 1) is 
an odd prime, then D = (a — 1)”, €p = 1, and the law of appearance implies 
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that a?~! =1 (mod p). (Since there is hardly anything to prove if p = 2 and 
a is odd, or if p is odd and p | a — 1, this law may actually be viewed as a 
full generalization of the Fermat congruence.) 


Remark 9 The law of appearance when €, = —1 may be seen via algebraic 
number theory. The non-trivial automorphism o of Q(a)/Q, which sends 
a+» 6 and vice versa, is the Frobenius automorphism of pO/p. That is, 
a(x) = z? (mod pO) for all 2 € O. Thus, Q = a8 = aa? = BB (mod pO). 
But, by (2.5), a?*! = B?*! (mod pO) holds iff p | Up41. 


We now turn to the appearance of prime powers. 
Lemma 10 For m and n nonnegative integers, we find that 
2°" an = mU,V-* (mod U2), 


Proof This is an immediate consequence of the multiplication formula (2.28) 
since modulo U3, only the term corresponding to k = 0 survives. 


We are now ready for a famous lemma which, essentially, asserts that the 
appearance of higher powers of p in Um, is governed by the power of p in m. 


Lemma 11 (Lifting-the-exponent lemma) Suppose p { 2Q is a prime 
that divides U,,. Then 
Umn ~ mU,, (base p). 


Proof We proceed by induction on ¢ where m ~, p‘. If €=0 or = 1, then 
we can downgrade the modulus in Lemma 10 to p?U;,, and obtain that 


27 — Un = mU,V+ (mod p*U,). 


By Lemma 3, p{ V,"—'. Thus, v,(mU,,V,"—1) = vp(mU,,) < vp(p?U,,) so that 
Umn ~p MU,,. Thus, the lemma holds for ¢ < 1. In particular, Up, ~p pUn 
for all n > 0. Assume the lemma holds for some ¢ > 1. Put m = kpt!, pt k. 
Then, as Upn ~p pUn, we may use the inductive hypothesis and write that 


Umea = Unsiigny © EB Up, © kp "Uy, =U, (base). 


Remark 10 Lemma 11 is essentially what Lucas [12, Sect. XIII] twice called 
“la loi de la répétition,” which translates into “the law of the repetition” (of 
the prime numbers). Thus, in English, for many authors who have worked 
with the Lucas functions, Lemma 11 is referred to as the law of repetition. 
Note that on [12, p. 210] of that same Sect.13, Lucas also refers to this 
phenomenon as the “appearance of the successive powers of prime numbers.” 


Example 10 In the Fibonacci sequence, p(3) = 4. Thus, 3| F,, if and only 
if 4| n. Moreover, Fym ~3 mF, = 3m. For instance, 3° || F3¢. 
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Remark 11 Theorem 9 and Lemma 11 provide respective proofs that a prime 
pt Q possesses a rank and that this is equally true of all its powers. These 
proofs are independent of the argument used in Lemma 8. Indeed, Theorem 9 
shows that p | Up—<,, and the proof of Lemma 11 does not refer to Lemma 8. 


Ep? 


Corollary 11 Suppose p { 2Q is a prime of rank p in U, and assume p’” || Up. 


Then 
(p*) = Pp, ifl<a<vp; 
a a pp, ifa>v. 
Proof If 1 <a< vy, then no term U; with 0 < t < p is divisible by p. Thus, 
p(p*) = p. Ifa > v, then, by Lemma 11, the least t such that p* divides U; 
is p*"p. 


Definition 2 (Rank exponent of a prime) Given p { Q of rank p, we 
define the rank exponent v of p as the full exponent of p in U,. That is, as in 
Corollary 11, p” || Up. Note that if U, = 0, then we agree that v = +00. 


Remark 12 Given a prime p, it is easy to construct a Lucas sequence U(P, Q) 
such that the rank exponent v of p is as large as we wish. A cheesy solution 
is to choose P = p”. Another would be to pick P prime to p and choose 
Q = P?—p” so that p(p) = p(p”) = 3. In the particular case of the Fibonacci 
sequence, no prime p has yet been found for which v > 2. There is a folk con- 
jecture, sometimes referred to as the Wall-Sun-Sun conjecture, which asserts 
that p? { F p(p) for all primes p. See [8] for an extensive bibliography on this 
conjecture. Although all primes <9.7 x 10’ satisfy p|| F, [4], there is little 
evidence from a probabilistic point of view for the truth of such a statement. 
The root a = 14v5 is a fundamental unit of the corresponding ring of inte- 
gers Z[a] of Q(a). We think a statistical study of rank exponents of primes 
in Lucas sequences built on other fundamental units would be welcome. 


Remark 18 In view of the importance of Fermat’s last theorem in Lucas’ 
investigations and related to the occurrence of rank exponents larger than 1, 
it is interesting to note that in 1909, Arthur Wieferich showed that if p is a 
prime and 

xP +yP = 2P (pt cyz) (2.53) 


has a solution, then p* | U,_1(3, 2). Such a prime is called a Wieferich prime, 
and the only ones known for all p < 6.7 x 10'° are 1093 and 3511. It is now 
known that if a (< 89) is a prime, then p” | U,-1(a+1,a) whenever (2.53) is 
true. (See Granville and Monogan [5].) Any prime p such that pt a(a—1) and 
p? | Up-1(a+1, a) is called a base-a Wieferich prime. An extensive computer 
search for such primes is discussed in Dorais and Klyve [4]. Of course, much 
of this has been rendered moot by the proof of FLT, but it does suggest the 
following extension of the Wieferich primes to Lucas-Wieferich primes. We 
say that if p is a prime such that p{ QD and 


p” | Up-e, (P,Q), 
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then p is a Lucas-Wieferich prime with respect to the Lucas sequence 


U(P,Q). 


However, there is one class of primes for which the rank exponent can only 
be 1. 


Theorem 12 Suppose p{ Q is an odd prime and either p > 5 and p | D, or 
p=3 and9|D. Then the rank exponent of p is 1, i.e., p? { Up. 


Proof By Theorem 9, p(p) = p. Using the identity (2.40), which expresses 
U, as a polynomial in P and D, with n = p, we find that as p > 3, 


oP-1Y, = pPP-1 + ¢ ) P?-3D (mod D?). 


Note that p | D and p { Q implies that p { P. Since p | D and p | (8) if 
p> 5, and 9| D if p =3, we see that 2?-'U, = pP?—' (mod p’). Therefore, 
Up ~p D- 


Remark 14 We observe that Theorem 12 does not hold for p = 2, or p = 3 
and 9 { D. For instance, if P = 4 and Q is odd, we find that 2 | D, but 
2? | Up. For p = 3, P = 10, and Q = 1, we obtain that 3 | D and U3 = 99. 
Thus, the rank exponent of 3 is 2. 


Remark 15 If Q = 1 and D = 0, then all primes p > 3 satisfy the hypotheses 
of Theorem 12. But then either U,, = I, = n for all n, or U, = (—1)"J, for 
all n, and the rank exponent is trivially seen to be 1. 


It remains to look at the appearance of powers of 2 in U, a case not 
addressed by the lifting-the-exponent lemma. We proceed to complete the 
picture in the next section. 


2.5 Divisibility of U, by Powers of 2 


The case when 2 is a special prime is fully addressed in Sect. 2.12. For now, 
we are content to observe, by Lemma 1, that, in that case and that case only, 
all terms U,,, n > 2, are even. 
If 2| Q and 2¢ P, then as mentioned in Lemma 1, all U,,, n > 1, are odd. 
Thus, we are mainly interested in the case 2 { Q. We state two distinct 
theorems depending on the parity of P. 


Theorem 13 Suppose Q is odd and P is even. Then for alln > 0, 


Venti ~l and Usa,~nUg=nP_ (base 2). 
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Theorem 14 Suppose P and Q are odd. Then p(2) = 3, and, for all n > 0, 
we have 


4|U3 and Us, ~nU3 (base 2), ifQ=1 (mod 4), 
and 


2||U6n+3, 8| Ue and Usn ~nUg (base 2), ifQ=-1 (mod 4). 


We begin with a few lemmas useful in proving both theorems. 


Lemma 12 Suppose Q is odd. If V; ~2 2, for some s > 0, then Vins ~2 2 for 
alln > 0. 


Proof If n = 0, then V,, = Vo = 2, and the lemma holds. If n > 1, then 
write n = (2h + 1)2". If u > 1, then, as Vs | Vionti1)s, Vi2n+i)s is even. Note 
that if 2 | V;, then 2|| Va; because Vo, = V2 — 2Q* and Q is odd. Thus, we 
see inductively that 2 || Vou(2n41)s = Vns- If u = 0, then n is odd, and we at 
least know that V,,, is even. Moreover, by the addition formula (2.11), 


2Vns = 2V(2n41)s = 2Vans+s = Vans Vs + DU ansUs. 


By Lemma 1, when Q is odd, U, is even iff Ve is. Thus, each of the three 
factors in the product Un,VisUs is even for all h. Hence, v2(DU2,,Us) > 
V2(UnsVnsUs) > 3. However, since 2h is even, 72(V2nsV;) = 1+1 = 2. There- 
fore, ¥2(2Vn5) = Yo(VonsVs) = 2 so that vo(V,,) = 1. 


Lemma 13 Suppose Q is odd. If 2| Us, then 


Uns ~Us (base 2), for all odd n> 1. 


Proof Note first that V, is even, whatever the parity of P, by Lemma 1. We 
reason inductively. The lemma clearly holds for n = 1. Suppose it holds for 
some odd n > 1. We proceed to show it holds for n + 2. Using the addition 
formula (2.10), we see that 


2U (n+2)s _ WU ns42s — Uns V25 + U2sVns- 


By the inductive hypothesis, v2(Uns) = v2(Us) and 12(V2,) = 1 since Vo, = 
V2 —2Q° and V, is even. Hence, 79(Un;V25) = 1+v2(Us), whereas U2,Vns = 
UsVsVns has a 2-adic valuation at least 2+ 12(U,) since V; | Vis. Therefore, 
¥2(2U (n42)s) =1+ V2(U,). That is, ¥2(U(n+2)s) = V2(U,). 


Lemma 14 Suppose Q is odd. If2 | U, and 2||V., then for all integers n > 0, 


Uns ~nU, (base 2). 
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Proof We prove the lemma by induction on 12(n). The base step of our 
induction holds by Lemma 13, i.e., for all odd integers n. 

Suppose U,,, ~2 nU, for all integers n > 1 with dyadic valuation k for 
some k > 0. Let m > 1 be an integer with v2(m) = k +1, and write m = 2n. 
By Lemma 12, 2|| Vis. Therefore, 


Ums = Vans = VnsUns ~ 2:nUs =mUs (base 2), (2.54) 


and our induction is complete. 


Proof of Theorem 18. As already observed in Lemma 1, 2 | U,, if and only 
if 2 | n. Thus, Uon41 ~2 1. To see that Uz, ~2 nU2, it suffices to apply 
Lemma 14 with s = 2. Indeed, 2|| V2 since Vo = P? — 2Q =2 (mod 4). 

Proof of Theorem 14. By Lemma 1, X,, is even if and only if 3 | n, where 
X stands for either U or V. 

Case 1. Q = 1 (mod 4). Then U3; = P? — Q = 0 (mod 4) and V3 = 
P(P? —3Q) = 2 (mod 4). Thus, U3, ~2 nU3 for all n > 1 by Lemma 14 with 
s=3. 

Case 2. Q = —1 (mod 4). Then U3 = P? — Q = 2 (mod 4) so 2|| Us. 
By Lemma 13, Un+1)3 ~2 U3, so 2|| Usn43. Now P? — 3Q = 0 (mod 4) so 
4| V3 = P(P? — 3Q). Hence, 8 | Us = U3V3. Also, 2|| Ve = V? — 2Q?. 

Applying Lemma 14 with s = 6 yields Ug, ~2 nUg for all n > 1. 


Remark 16 In the case Q = —1 (mod 4), there is a jump equal to 
V2(P? — 3Q) > 2 


between the 2-adic valuations of U3 and Ug larger than the expected jump of 
1 coming from the fact that 6 = 2 x 3. But starting from Ug, the increase in 
the 2-adic valuation of Ug, is again governed by v2(n), the 2-adic valuation 
of n. 


Remark 17 We note in passing that Theorem 13 can be proved directly using 
the polynomial expression (2.31) for U,,. Indeed, 


” (2n—k 7 
Con41 = evel “ ae 


k=0 


and the only odd term in this sum is (—Q)” obtained when k = n. Thus, 
Uon+1 ~2 1. Also, from (2.31), we see that 


n—-1 
2n-—k-1 -ob-1Ak 
Uan = evel i Va 2k ae 


k=0 


from what we might suspect the term (—1)”~'nPQ"~! ~2 nP = nU2, ob- 
tained when k = n— 1, to have the lowest 2-adic valuation of all terms. This 
inkling was proved in [2, p. 273], putting a = 12(P) > 0 and b = 12(Q) = 0. 
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Using Theorems 13 and 14, we state a corollary analogous to Corollary 11. 


Corollary 15 Suppose 2 { Q. Let p and v, respectively, denote the rank and 
the rank exponent of 2. If P is even or if Q =1 (mod 4), then 


7 Pp, ifl<a<vp; 
nny = 10, 
2°-"p, ifa>v. 


If P isodd and Q = —1 (mod 4), then, with vy, = v2(Ug) = 1+12(P?2—3Q) > 
3, we find that 
3, ifa = 1s 
p(2*) = 4 6, ilo Xs wy; 
2°-”*.6, ifa> vy. 


2.6 Composite Integers Dividing U and Their Rank 


By Lemma 8, we know all integers prime to @ admit a rank of appearance. 
Also, from Theorem 8, given an integer m > 2 prime to Q, we know m divides 
U,, if and only if n is a multiple of the rank of m. Moreover, if some prime 
factor of m divides Q, but not P, then by Lemma 6, no U,,, with n > 1, 
is divisible by m. We have gathered information on the rank of primes and 
prime powers in the previous two sections. We now state a theorem on the 
rank of m, m prime to Q, in terms of the ranks of its prime factors. 


Theorem 16 Let m > 2 be an integer prime to Q of rank p in U(P,Q). If 
m = |Jj_, ps’, then 


p = lem (p(p7"), p(P3?),---, ppr"))- 


Proof Let M be the least common multiple of the ranks p(pf*), 1 <i <r. 
By Theorem 8, 
m|U, <=> p|n. 


But m | U,, if and only if pj 
again, 


U, for all 1 <i <r. Hence, by Theorem 8 


m|Un => plpj')|n (alli, 1<i<r) 
<= Mn. 


Hence, p = M, i.e., M is the rank of m. 


Example 17 With respect to the Fibonacci sequence, p(3) = 4 and p(7) = 8 
so p(21) = lem(4, 8) = 8. Indeed, Fg = 21. 
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Application 18 Prove that for all n > 1, P? | P,, iff nP, | m, where 
P = U(2,~-1) is the so-called Pell sequence. Suppose The \| P, P® is the prime 
factorization of P,. Note that if p*||P,, then a > v, where v is the rank 
exponent of p. By Theorem 16 and Corollaries 11 and 15, 


p(Pn) = lempa | p, ( =( ile ) -Iemyjp, (p (p)). 
P\|Pr 


Hence, 


p(P2) = ([[ »”) 4) -lemp)p, (p (p )) 


PIPn 
=(][ 2%) -([[ 2°”) lee, (0) 
p|Pr P|Pn 


= P,,- p(P,) = Pr-n. 


Indeed, the fact that for n > 2, P, > 1 and P, is an increasing function of n 
implies that p(P,,) =n. Hence, P? | P,,, if and only if nP, | m, by Theorem 8. 
The case n = 1 is trivial to check. (If n > 3, then F? | Fi, iff nF, | m, where 
F;, designates the kth Fibonacci number. This property is a lemma famously 
used by Matijasevich in his solution to Hilbert’s tenth problem.) 


2.7 Divisibility of V,, by Powers of an Odd Prime 


If {U,V} is a pair of Lucas sequences, then, while all primes not dividing Q 
divide some terms of U, there are usually infinitely many primes which do not 
divide any term of the companion sequence V. Actually, the infinite collection 
of primes which do not divide any V, represents a positive “proportion” of 
the primes. Precise theorems on the size of the set of primes that divide a 
companion Lucas sequence will be stated in Chap. 9. 

As an example, consider the sequence of Lucas numbers L, where Ly, = 
V,(1, —1). We find that modulo 5 


Le 813421... 


That is, LZ (mod 5) has period 4, and the residue 0 (mod 5) never appears. 
Thus, for all n > 0, 5{ Ly. Similarly, for p = 13, we find that modulo 13 


be O59 4629.5. 8 58 Oa 91 9 1, 


Since Ly4 = —Lo = —2 (mod 13) and Lj; = —L, = —1 (mod 13) and 
134 L, for 0 <n < 13, there can never be a value of n > 0 such that 13 | Ly. 
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In fact, in 1985, Lagarias [9] showed that on average, exactly one out of three 
primes does not divide any Lucas number. 


Let us begin by defining the V-rank of a generic integer m. 


Definition 3 (V-rank) Let m > 2 be an integer. We define the V-rank of 
m, or the rank of appearance of m in V, denoted as o, or a(m), to be, if it 
exists, the least positive index t such that m | V;. We may occasionally use 
the alternate notation py or py(m). 


We are about to see that for all primes not dividing 2Q, there is a very 
simple criterion for the V-rank to exist. 


Theorem 19 Let p{ 2Q be a prime of rank p. Let o denote the V-rank of p. 
Then 
o exists => p is even, 


in which case 0 = p/2. Moreover, the p-adic valuations of U, and Vz are the 
same. 


Proof We begin with the proof of the converse. Thus, we suppose p is even, 
say p = 2p*. Then p{ U,«, but p| U,. Since by (2.23) U, = U,+V,*, p must 
divide V,+. Thus, 0 < p* exists. Moreover, by (2.23) again, as p | Vo, p | Ua. 
Hence, p | 2c. That is, p* | co. Thus, o = p*. 

Now assume that o exists. Then p | V, implies that p | U2,. Hence, p | 20. 
By Lemma 3 and since p { 2Q, p{ gcd(U,, V,). So pt U,. But p| 20 and pt oa 
says that p is even. (Therefore, by the proof of the converse, o = p/2.) 

That U, ~p Vo follows from the identity U, = U,V, and p{U,. 


It is easy to verify that 5 and 13 are the least two primes with no L-rank, 
where L = V(1,—1). Their respective ranks in F = U(1,—1), 5 and 7, are 
indeed odd. 

Theorem 19 also holds for prime powers. That is, the V-rank of the power 
of a prime p not dividing 2Q exists if and only if the rank of this prime power 
is even. This occurs if and only if o(p) exists. 


Corollary 20 Let p{ 2Q bea prime of rank p and rank exponent v. Suppose 
a> 1 is an integer. Then 


a(p") exists <> pis even. 
Moreover, if p is even, then 


o(p") = spu(P") = 5 PP eo, 


where [—] denotes the Iverson symbol. Also, the p-adic valuations of Upp) 
and V,(pe) are equal. 
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Proof We can clone the proof of Theorem 19, once we observe, by Corol- 
lary 11, that p(p*) = pp*, where ¢ := (a — v)[a > v}, and thus that p is even 
if and only if p(p*) is even. 


Alternatively, we can state part of the previous result in the manner of 
Corollary 11, as follows: 


Corollary 21 Suppose p{ 2DQ is a prime and a(p) exists. Then 


0, ifl<a<v; 
o, ifa>y, 


where p” || V,. 


Odd primes p dividing D have no V-rank by Theorem 9 since their rank 
p, equal to p, is odd. Hence, they are excluded twice from consideration in 
the statement of Corollary 21. 

For the powers of an odd prime with even rank, we may state a law of 
regular division analogous to Theorem 8. 


Theorem 22 Given a prime p { 2Q of even rank and an integer a > 1, we 
have for alln > 1 


p" | Vn => n= (2k 4+ 1)o(p"), 
for some k > 0. 


Proof By the double-angle formula (2.23) and Lemma 4, p* | V,, implies that 
p®* | Uan, but p* t U,. Hence, p(p*) | 2n, but p(p*) { n. Since, by Corollary 20, 
p(p*) = 2a(p*), we see that o(p*) | n, but 2o0(p*) { n. It follows that n is an 
odd multiple of o(p*). 

For the converse, we only need to invoke Theorem 4. 


Corollary 23 Suppose p { 2Q is prime and m and n are positive integers. 
Then 
mnvogn => gcd(Vin, Vn) ~p gcd(m,n)- 


Proof Say p* | gcd(Vin, V,). Then, by Theorem 22, m = (2k + 1)o(p%) and 
n = (2€+1)o(p*) for some nonnegative k and €. Thus, o(p*) divides gcd(m, n) 
and o(p*) ~2 m~2 n ~2 gcd(m,n). Therefore, by Theorem 4, p* | Vaca(m,n): 
If p* | Vecd(m,n)> then, by Theorem 4, p* divides both V,, and V, since m 
and n are odd multiples of ged(m,n). 


Regular primes dividing Q cannot divide any U term, so they cannot divide 
any V term by the double-angle formula. Thus, we now turn to the divisibility 
of V terms by powers of 2. 
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2.8 Divisibility of V, by Powers of 2 


The parity of V terms was given in Lemma 1. But the behavior of the prime 
2, when 2 { Q, is radically different from the behavior of odd primes p { Q. 
Indeed, by Corollary 20, if p is odd and o(p) exists, then powers of p in V 
are unbounded. On the contrary, powers of 2 always remain bounded in a 
companion Lucas sequence, at least in the nondegenerate case. 


Theorem 24 Suppose Q is odd. If P is even, then, for alln > 0, 
Von ~2 and Vanii~ P (base 2). 
If P is odd, then we have 2| V, <=> 3|t. Moreover, for n> 0, 
Vseon+1) = Ven¢3 ~ V3 ~~ P?-3Q and Ven ~2 (base 2). 
Proof Suppose P is even. By Theorem 13, 
Uon+1 ~2 L and Uan+2 ~2 (2n + 1)P ~2 P. 
Since Von4i1U an41 = Usn+2, we have Vony1 ~2 Usan+2 ~2 P. Also, Vo = 
P? — 2Q ~2 2 so that, by Lemma 12, Van ~2 2 for all n > 0. 

Now suppose that P is odd. That 2 | V; iff 3 | t was already observed in 
Lemma 1. Since V3 = P(P?—3Q) and P is odd, V3 ~2 P?—3Q. Thus, if Q = 1 
(mod 4), then P? — 3Q = 1+Q = 2 (mod 4), so V3 ~2 2. By Lemma 12, 
V3n ~2 2 for all n > 0. Suppose Q = —1 (mod 4). By Theorem 14, 

Usant1) ~2 2 and Usgan41) ~2 (2n + 1)U6. 
In particular, U3 ~2 2. Therefore, 


2V3 ~ U3V3 = Us ~ Usi2n41) = U3(an+1) V3(2n41) © 2V3(2n41) (base 2). 


This implies V3(2n+41) ~2 V3. 
Again, by Theorem 14, Ugn ~2 nUg. Hence, 


2nUg ~ Ujen = VenVen ~ NU6 Ven (base 2), 


from which we see that Ven ~2 2. 


Remark 18 The 2-adic valuation of V is simplest in two cases. If Q is odd 
and P = 2 (mod 4), then V,, ~2 2 for all n > 0. If PQ is odd with Q = 1 
(mod 4), then V, ~2 2, if 3 | n, and V, ~2 1, if 34 n. 


We rephrase Theorem 24 as a law of appearance for powers of 2 in the 
next corollary. Note that Equation (2.55) is reminiscent of Theorem 22. 
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Corollary 25 Suppose Q is odd. Then, for a > 2, 


(2") 1, if4| Panda < 1,(P); 
oO — 
3, if Pisodd,Q=-—1 (mod 4) anda < 12(P? — 3Q), 


and 
2°|V, => n= (2k 4+ 1)0(2"), (2.55) 


for some k > 0. Moreover, if a exceeds, respectively, v2(P) or ve(P? — 3Q), 
then (2%) does not exist. 

To complete the picture, we have for P even: 

2||V,, for all n > 0 if P=2 (mod 4) or if n is even and 4 | P. 

For P odd, we have: 

2|| V3, for all n > 0 if Q =1 (mod 4) or if n is even and Q = —1 (mod 4). 


A result similar to Corollary 23 can be proved with respect to p = 2. 


Corollary 26 Suppose @ is odd and m and n are positive integers. Then 


mroan => gcd(Vin, Vn) m2 Vecd(myn): 


Proof We use Theorem 24 throughout. Say 2° | gcd(Vin, Vi). Then a < M, 
where 
M := v9(P)- [2 | P|] +12(P? — 3Q) - [24 P] > 1. (2.56) 


If a > 2, then we can, using (2.55), carry out the same reasoning as in 
the proof of Corollary 23 to prove our result. So assume a < 1. If P = 2 
(mod 4), then all Vj, t > 0, have 2-adic valuation 1. So the result holds. If 
4 | P, then we only need to consider the case where m and n are both even; 
otherwise, a > 2. Thus, a = 1. Hence, 2|| Veca(myn) because ged(m, n) is even. 
So the result is true. Suppose P is odd. When Q = 1 (mod 4), all even V 
terms have 2-adic valuation equal to 1. Since V; is even iff 3 | t, we see that 
2|| gcd(Vin, Vn) iff 3 | ged(m,n), which occurs iff 2|| Veca(mn). If Q = —-1 
(mod 4), then 2|| Veca(m,n) iff 6 | gcd(m,n), which occurs iff V;, and V,, are 
both even with 2-adic valuation 1. 


Remark 19 It may be observed that if 4 | V, for some n > 1, then 2™ || Vj, 
where M was defined in (2.56). In fact, n is an odd multiple of a(4) = o(2™). 


2.9 Composite Integers Dividing V 


We are now concerned with integers m > 2 that are divisors of a companion 
Lucas sequence V, i.e., such that m | V, for some n > 1, which we write 
m | V. As mentioned earlier, the set of primes which do not divide V not 
only is most often infinite but contains a positive proportion of the primes. 
Therefore, the set of integers whose prime factors all divide V must have 
a 0-asymptotic density within the set of integers. A fortiori, if Dy denotes 
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the divisor set of V, ie, if Dy := {m; m | V}, then #Dy(x) = o(z). 
That is, few integers divide V. Actually, most integers with prime factors 
all dividing V are not in Dy as Theorem 27 will suggest. In fact, when 
a and § are coprime integers, a/3 #4 +1, Moree’s work [14, Theorem 5] 


provides estimates for #Dy(x). In particular, if a/8 ¢ Q’, then #Dy(z) 
2/3 


is asymptotically equivalent to cx/log*/” x for some positive constant c, as 
x — oo. The next theorem gives an idea of how stringent conditions are for 
a generic integer to divide a companion Lucas sequence. 

Theorem 27 Suppose m > 3, ged(m,Q) = 1, and m = 2°T[}_ ip is the 
factorization of m into prime factors 2 < py < pg <-+-: < ps, where a > 0, 
s>0, and b; > 1. Let p; denote the rank of pi and pj \ Up,- Then o(m), the 
V-rank of m, exists if and only if the three conditions below are satisfied: 


© V2(p1) = =+++=12(p5) = % > 1, 
if P is even; 
ven —3Q), if P is odd, 
vy=1, ifa> 2. 


In case of existence, 


1 e e e 
a(m) = 5 Icm(po, P1P1' > P2P> 5 eee iPad.” )s (2.57) 


where 


2, otherwise, 


{° if P is odd and a > 0; 
Po = 


and e; = max{0,b; — v;}. Moreover, for alln > 0, 


m| Vn => n=(2k+1)a(m), for some k > 0. 


Proof The existence of a vy, > 1 such that 


Vs = V2(p1) = V2(p2) = +++ = V2(ps) (2.58) 


is necessary for a(m) to exist. Indeed, m divides V implies all its odd prime 
factors divide V. But by Theorem 19, p; divides V iff p; is even. Note that, 
by Corollary 21, p | V implies p? | V for all b > 1. Now, in order to find an 
n > 1 such that all pe simultaneously divide V,,, m must be an odd multiple 
of all o(p?*). But o(p?') ~2 o(p;). Hence, all o(p;) must have the same 2-adic 
valuation. Since p(p;) = 20(p;), there is an v, > 1 such that (2.58) holds. If 
m is odd, or if m is even with a = 1 and P even, in which case 2 | V,, for 
all n, then the condition (2.58) is sufficient. Moreover, n is an odd multiple 
of all o(p>') if and only if n is an odd multiple of the least common multiple 
of all o(p?'), which must be o(m). Hence, o(m) is given by (2.57) in those 
cases. 
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Note that if a > 1 with P odd, then 2 | V,, iff 3 | n. So we must make sure 
that 3 | o(m) which explains the presence of pp = 6 in (2.57). If a > 2, then, 
by Corollary 25, a must be bounded above either by v2(P), if 4 | P, or by 
V2(P? — 3Q), if P is odd, in order to have V terms divisible by 2°. Moreover, 
by (2.55), n must be an odd multiple of a(2%). Since o(2°) is either 1 or 3, 
a(2*) is odd, and we must have v, = 1 in order for m to divide V,. 


Example 28 Consider L = V(1,—1). Then m = 8f L since a = 3, P?-3Q = 
4, and a > 12(P? — 3Q). However, m = 4 divides L. Here, pp = 6, (4) = 3, 
and 4 | L,, iff n = 6k + 3 for some k > 0. But 21 = 3-71 L because p(7) = 8 
has a 2-adic valuation greater than p(3) = 4. Now 319 = 11-29 | L, because 
p(11) = 10 and p(29) = 14 are both even with 2-adic valuation 1. Hence, 
o(319) = 35 and 319 | Lrox+3s for all k > 0. Similarly, 1276 = 4-319 divides 
L because v, = 1. But 284 D since vy. = v2(p(7)) =3 > 1. 


We state a corollary of Theorem 27, which is only a particular case of that 
theorem, but with a simpler statement. 


Corollary 29 Suppose m > 3, coprime to Q, is either odd or such that 2 || m 
and has s > 1 odd prime factors p;,..., ps. Let p; denote the rank of p;. Then 


a(m) exists <> V2(p1) = V2(p2) Se = V2 (Ps) > 1, 


where, in case of existence, a(m) = ern) Moreover, for all n > 0, 


m|Vn <=> n=(2k+1)o(m), for some k > 0. 
Remark 20 We observe that if m admits a V-rank o(m), then, in fact, we 
always find that 
p(m) 


a(m) = a 


Indeed, if m = 2%b, b odd, 2 <a< M, where M was defined in (2.56), then 
2, if2| P; 
Qe = ’ ’ 
pen fs if 24 P. 
Hence, p(m) = lem(p(2"), p(b)) = 21em(a(2"), 0(b)) = 20(m). 
The next result is a combination of Corollaries 23 and 26. 


Corollary 30 Let p{ Q be a prime and m and n be two positive integers. 
Then 
moran => gced(Vin, yy) ™p Vecd(m,n): 


Before ending the section, we single out, without proof, a few corollaries 
which are immediate consequences of the preceding theorems. 
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Corollary 31 If m > 3, gcd(m, Q) = 1, then 


m|V, => 2¢ 


n 

o(m) 

Corollary 32 If £ is a product of regular primes and @ | V;,, then 
nd2m => ged(é,V,) =1 


Corollary 33 If €; | Vm and 2 | Vz, where m ~2 n and ged(é;, 2) = 1, then 
ly bo | Vicm(m,n)- 


Corollary 34 If €; | Vi», and €, | V, with m 42 n, €; > 3, 2 > 3, and 
gcd(l1 2, Q) = 1, then by bo t Ve for all k > 0. 


Corollary 3851f m > 3, gcd(m,Q) = 1, and 2 { p(m), then o(m) cannot 
exist. 


2.10 Euler’s Criterion for Lucas Sequences 


Euler’s criterion says that if p is an odd prime and a is an integer prime to 
p, then a is a square modulo p if and only if oe =i (mod p). That is, if 
U is the Lucas peaence with parameters P = a +1 and Q = a, then the 
rank py(p) divides po halfway through the maximal potential rank p — 1. 
The next theorem, much as the law of appearance generalized Fermat’s little 
theorem, generalizes the classical Euler criterion. Interestingly, it is also valid 
when €, = —1, where €, was defined in (2.52). In the sequel, (Q |p) denotes 
the Legendre character of @ modulo p. 


Theorem 36 (Euler’s criterion for Lucas sequences) Suppose U is a 
fundamental Lucas sequence with parameters P and Q and p is a prime not 
dividing 2QD. Then 
P | Ur=ep <= (Q|p) =1. 
We begin with two lemmas. 


Lemma 15 If p{ 2QD is prime, then 


Gee = 2Q2" (mod p). 


Proof As p{ D, é& = +1. 
Case 1. €, = 1. Using the subtraction formula (2.13), we see that 


2QV,-1 = VpVi — DU,U; = P? -D =4Q (mod p), 


2.11 Degenerate Lucas Sequences 43 


where the congruence is obtained with the help of Lemma 9. This yields 
Vp—1 = 2 (mod p). (Alternatively, when €, = 1, we may view a (mod p) and 
B (mod p) as belonging to the finite field F,. Thus, V,-1 = a?~! + BP t= 
1+1=2 (mod p).) 

Case 2. ¢, = —1. Using the addition formula (2.11) and Lemma 9, we 
find that 


211 = VpVi + DU, U, = P? -D =4Q (mod p), 


yielding V,41 = 2Q (mod p). (An alternate way of seeing this is that, by 
Remark 9, we have 


Voor = a? th + BPTI = aa? + 6B? =aB+ Ba=2Q (mod p).) 
Lemma 16 If p{ 2QD is prime, then 
(Q|p)=-1l = P | Vo=ep - 


Proof Using identity (2.24) and Lemma 15, we have 


2 
Vo-ep = Vp-p 
2 


=2 


=" (1+@Q°=) (mod p), 


which yields the lemma since Q*°= = (Q|p) (mod p). 


Proof of Theorem 86. By the law of appearance for primes, p | U,_.,. But 
Up—e, = Ur-ep Vo-ep , and p cannot divide both Up- “<p and Vp-ep by Lemma 3. 
v2 ~ 2 2 


Hence, p | Uv-ep if and only if p { Ve-ep, which occurs, since p { Q, if and 
25 =o 
only if (Q|p) = 1, by Lemma 16. 


Remark 21 Here is a direct proof of this criterion using algebraic number 
theory. If €p = Ai then a (mod p) and 8 jmod 7) Delong to F,. Thus, both 
a’ and B*=" are in {+1}. Hence, Q” 7 =o pr = (mod p) iff 
ar = BPE (mod p), which holds, as p { D, iff p | Cis 1. Ife, = —1, 
then the non-trivial automorphism o of Q(a)/Q sends a a 8. As seen in 
Remark 9, Q = aPt! = gP+1 (mod pO). Thus, (Q|p) = 1 iff both a? and 
8°" are in F,. Hence, (Q|p) = 1 iff ar = o(a’z) = BY (mod pO), 
which is true iff p | Ups. 
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Consider the terms of the Lucas sequence U(—1, 1) 
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C0 tit oe 20 teas 


This sequence presents little interest. Moreover, all the terms U3, are zero. 
It is often necessary to remove Lucas sequences having a zero term from 
consideration, particularly when division is involved. 


Example 37 Besides U(—1,1) with Us = 0, the terms of U(2,2) are, with 
k>0, 


0,139.2 0,4. 808 0,16, 39,39, 0,.4 25,0, (—4)* o(—4)* oa", 


and U, = 0. Also the terms of U(3,3) are 


6; 1/3,6;9,9,0,—33, 34, —9 94 3%, 9", ..., 
0,(—1)*33*,(—1)*334+1 (—1)* 338, (—1)*2 7 33kt1 (—1)"37"7?, (—1)*334+2, a. 


with k > 0. Here, the first zero term is Ug. 


Definition 4 (Degenerate Lucas sequence) We say that a Lucas se- 
quence U(P, Q) is degenerate, or that the pair {U, V} is degenerate, whenever 
some U term with positive index is zero, i.e., if there exists an n > 3 such 
that U, = 0. 


We seek simple characterizations for degenerate Lucas sequences and begin 
with an easy lemma. 


Lemma 17 The only roots of unity lying in a quadratic number field have 
order 1, 2, 3, 4, or 6. 


Proof Say ¢ is a primitive root of unity of order n, i.e., the least integer k > 1 
such that ¢* = 1 is n. Then it is well known that the cyclotomic extension 
Q(¢) of Q has degree y(n), where y is the Euler totient function. As ¢ lies in 
a quadratic field, we need to have y(n) < 2. If the prime power p® divides n, 
then p*~!(p—1) | y(n). Thus, the only acceptable prime factors are 2 and 3, 
ie., n = 2°3° witha <2 andb<1,n4 12. 


Theorem 38 The sequence U(P, Q) is degenerate if and only if the following 
equivalent conditions are satisfied. 


(i) The ratio a/B is a root of unity of order 3, 4, or 6. 
(tt) UsUg = 0. 

(iti) Ur =0. 

Gu) P* =O) PP? = 20, or P? =30. 


Proof From (2.6) and Q = a8 4 0, we see that U, 4 0, for all n > 1 in 
case a = 6. If a # 8, then U, = ae . Thus, U, = 0 iff (a/B)" = 1. 
Since a/8 belongs to the quadratic field Q(VD) and is a root of unity, we 
see from Lemma 17 that n is limited to the values 1, 2, 3, 4, or 6. Since 
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U, = 1 and Up = P #0, n must be 3, 4, or 6. All three possibilities occur 
as seen in Example 37. This proves condition (7), but also the equivalent 
condition U3U,Ug = 0. But since U is a divisible sequence, U3 = 0 implies 
Us = 0. Hence, condition (iz) is necessary and sufficient. Similarly, U, = 0 
and Ug = 0 each imply Uj2 = 0. Conversely, since Uj2 cannot be the first 
positive-indexed zero term of a U sequence, U;2 = 0 implies U4, or Ug is zero. 
The equivalence with condition (iv) is obtained by noting that U3 = P? —Q, 
U4 = PP? = 2Q), and Us = U2U3(P? = 3Q). 


Since a number of results are established under both the nondegeneracy 
and the regularity hypotheses, it is convenient to identify explicitly which 
sequences are actually excluded from consideration. 


Corollary 39 There are only two regular degenerate Lucas sequences U(P, Q). 
They correspond to P=+1 and Q=1. 


Proof It suffices to note that in the fourth condition of Theorem 38, the two 
possibilities P? = 2Q, or P? = 3Q, cannot occur when gcd(P,Q) = 1. The 
condition P? = Q and gcd(P,Q) = 1 occurs exactly for P = +1 and Q = 1. 


Remark 22 Alternatively, one could prove Corollary 39 by observing first 
that Q must be +1. Indeed, if some prime p divides Q, then no U term is 
divisible by p. In particular, no U,,, n > 0, is zero. 


Remark 28 Arusingly, one may obtain Lemma 17 with the laws of appear- 
ance and repetition and the Dirichlet theorem on primes in arithmetic pro- 
gressions. Suppose U is degenerate and n is the least t > 1 such that U; = 0. 
Then for all primes p { Q, p(p) | n. Since Ug 4 0 if 0 < d < n, all primes p, up 
to finitely many exceptions, must satisfy p(p) = n. By the law of appearance, 
n|p+41. Hence, all, but finitely many primes, lie in the two arithmetic pro- 
gressions +1 (mod n). Therefore, y(n) < 2, and n must be 3, 4, or 6. By the 
same token, a degenerate sequence cannot have a square discriminant D for 
then all primes, but finitely many, would be 1 (mod n). This can be easily 
verified: Indeed, P? = Q, 2Q, or 3Q with P? — 4Q = A?, respectively, forces 
—3P?, —P?, and —P?/3 to be integral squares. However, invoking Dirichlet’s 
theorem here is like using an elephant to kill a mosquito. 


2.12 Divisibility of U, and V, by Powers of a Special 
Prime 


Let {U,V} be a pair of Lucas sequences with parameters P and Q. For pa 
regular prime, we know the p-adic valuation of all U and V terms: If p | Q, 
the p-adic valuation of all U, and V, with n > 1 is zero by Lemma 2. If 
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pt{ 2Q, then Corollary 11 and Theorem 8 fully describe the p-adic valuation 
of all U;,, while Corollary 21 and Theorem 22 give the valuation of all V,,. If 
p = 2 and Q is odd, then Theorems 13 and 14 provide the 2-adic valuation of 
all U,,, and Theorem 24 informs us about the 2-adic valuation of all V terms. 


We summarize these various theorems that concern regular primes in two 
nearly identical propositions, as they differ only in their form. But one word- 
ing is sometimes more convenient than the other. 


Proposition 40 Suppose p is a regular prime. If p | Q, then v,(Un) = 0 for 
alln > 1. If p{Q, then p admits a finite rank p, where 


a divisor of p—(D|p), if p> 3; 
pis § 2, if 2| P and p = 2; 
3, if P is odd and p= 2. 


Let v be the rank exponent of p { Q. If p > 3, or if p = 2, but either P is 
even, or P is odd and Q = 1 (mod 4), then 


7 = 0, if pt n; 
Un) ‘ +y,(n/p), if p|n. ee 


If P is odd and Q = —1 (mod 4), then 


0, if 34 n; 
v2(Un) = ¢ 1, ifn = 6m +43; (2.60) 
V2(Ug) +12(m), ifn =6m. 


Proposition 41 Suppose p is a regular prime of rank p. Then for n > 0, 
Y¥>(U,) > 0 if and only if p divides n, (2.61) 


with the convention that p = +00, if p| Q. 


Furthermore, if p > 3, or if p = 2, and either P is even, or P is odd with 
Q =1 (mod 4), then for all n > 1, 


Up(Upn) =U + p(n), if pt Q, (2.62) 
where v is the rank exponent of p. If P is odd and Q = —1 (mod 4), then 


1, if 24 n; 


2.63 
v2(P? —3Q)+12(n), if 2|n. oy) 


¥2(U3n) = 


Now we address the problem of determining the p-adic valuation of the 
terms of Lucas sequences when p is a special prime. Thus, we have p* ||P 
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and p? ||Q, where a and b are positive integers. These results appear in the 
paper [2], but the proofs provided here are more concise. 

A lower bound for v,(U,,) is given in the next theorem. It implies that 
v,(U,,) tends to infinity as n > oo, a behavior that sets special primes apart 
from regular ones. 


Theorem 42 Suppose p is a special prime, t.e., p divides gcd(P,Q). Then 


for alln > 1. 


Proof We proceed by induction on n after noting that as U; = 1 and U2 = P, 
we clearly have v,(U;) = 0 > [5] and v,(U2) > 1 = [4]. Suppose n > 3 and 
Vy>(Ux) = [S| for k =n—2 and n-—1. Then 


Vp(Un) 2 min{Yp(PUn—1), %(QUn—2)} 
> 14+ min{yp(Un-1),%(Un—2)} 


ewe 


We will consider two cases: b > 2a and 2a > b. 

Case I. b > 2a. 

The p-adic valuation of U,, will be seen to contain two terms. One term, 
(n — 1)a, gives a steady linear growth in terms of n; the other is the p-adic 
valuation of U/,, where U’ is a Lucas sequence for which p is regular. 


Theorem 43 Suppose p is a special prime with P = p*P’, Q = p’Q’, pt 
P'Q', and b > 2a. Then for alln > 1, 


(n —1)a, if b > 2a; 


rte) = (=D v= | (U),) if b = 2a 


where U'! = U(P', p’-22Q'). 
Proof By (2.30) and writing b = 2a+c, (c > 0), we find that for all n > 1, 
Un = Un(P,Q) = 5_(-1)* (" ‘) aaa 
n n ’ a k 


-_ So(-p (" _ j _ ‘) por eka he pe ee (p°Q’)* 


k>0 
= pew, (P’, p°Q') a p'-)*U, (P’, p°Q’). 


By Lemma 2, no term of U(P’, p°Q’) is divisible by p if c > 1. 
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Case II. 2a > b. 


We start with the p-adic valuation of odd-indexed terms U2n+1. 
Theorem 44 Suppose p is a special prime, P = p*P’, Q = p’Q’ with 2a > b, 
and p{ P’Q’. Then, for all odd n > 1, 

mr ' 
Up(Un) = |e =bm, ifn=2m+1. 


Proof Again, by (2.30), we have 


Vom+1 = So(-1 k 


k=0 


© - *) p2m—2k Ok, (2.64) 


The p-adic valuation of P?”-?*Q* is 2ma — (2a — b)k, which reaches an 
absolute minimum when k = m. As, for k = m, the binomial coefficient 


2m—k\ | 1 
A = 
is not divisible by p, the p-adic valuation of U,, is decided by the last term of 
the sum in (2.64). Hence, we obtain 


Vp(Un,) = 2ma — (2a — b)m = bm. 


We now look at the valuation of Uz,, when 2a > b. We begin with a lemma 
established with the method and notation of Theorem 43. 


Lemma 18 /f 2a = b+c with b > 2 even and c> 0, then 
Un(P, Q) =p" - Un (p/?P', Q’). 
Proof Note that 


-—l- c n—1— brie n—l1— 7 
pr} 2kEQk — pf 1 2k) 3 (p /2 pl) 12k | pbk ih 


Therefore, by (2.31), Un = %(P,Q) = p%—*/?U, (p/?P’, Q’). 


Theorem 45 Suppose p is a special prime, P = p*P’, Q = p’Q’ with 2a > b, 
and p{ P’Q'. Then, for alln > 1, 


Vp(Uan) = bn + (a — 6) + p(n) + hn, (2.65) 


where 
hy = Up(P? — Q')-[2< p< 3]: 2a=b+ 1]: [p| mn). 


2.12 Divisibility by Powers of a Special Prime 49 
Proof By (2.20), 

Usn = Us(P,Q) - Un (Va, Q?) = P - Un(V2, Q?). (2.66) 
Set a! := vp)(V2) = vp(P? — 2Q) and b! := v,(Q?) = 2b. We see that 


b, if p is odd; 
a=b+1, if p= 2 and 2a > b+ 2; (2.67) 
b+1+1%(P2-Q'), ifp=2and2a=b+1. 


Define c’ by 2a’ = b' +c’. Since b’ = 26 is even, we apply Lemma 18 to 
U(V2, Q?) to yield 


Un(Vo, Q?) = p-Y"'/2U, (p°/2V3, Q”?), (2.68) 


where Vi = p~* Vo. 


Case 1. p = 2. Note that, by (2.67), ¢ = 2114+ ’- [2a = b+ 1])) with 
v' = 2o(P? — Q’). Since c > 0, 2 is a regular prime of rank 2 and rank 
exponent c’/2 in U(p"/?2Vj,Q’). Thus, combining (2.66) and (2.68) with 
(2.59), we obtain 


V2(Uan) = ¥2(P) + v2(p—9"/?) + v5(Un(p°/?V3, Q”)) 
=a+(n—1)b+ ((e/2—1) + 12(n)) - [2| n] 
= bn+(a—b)+(- [2a = b+ 1] + 12(n)) - [2 | J. 


That is, 
V2(U2n) = bn + (a — 6) + v9(n) +’ - 2a=b+1]- [2 | nl], 


which corresponds to (2.65). 
Case 2. p > 3. We see that a’ = b'/2 = b so that c' = 0. Hence, by (2.68), 


Vp (Un (V2, Q*)) = (n — 1)b + Yp(Un(p-°V2, Q”)) 
and, by (2.59), we get 
Vp(Un (V2, Q*)) = (n — 1)b + ((v—1) + up(n)) - [o |r], (2.69) 


where p is the rank and v the rank exponent of p in U(p~’V2, Q’”). By (2.66), 
Up_(Uan) = a+ Yp(Un (V2, Q?)), and we conclude that 


Vp(Uon) = bn + (a — 6) + (vp(n) +(v— 1)) -[p| nj. (2.70) 


To determine the values of p and v, we compute the discriminant D’ associ- 
ated with U(p~°V2, Q’”). Thus, 
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D! = p-” ((P? — 2Q)? — 4Q?) = p-*(P* — 4P?Q). 
As v,(P*) = 4a > 2a + b = v,(4P?Q), we see that 
Vp(D') = —2b 4+ (2a + b) = 2a—b > 0. 


Therefore, p(p) = p. Furthermore, if p > 5 or if p = 3 and 2a > b+ 2, 
then vy = 1 by Theorem 12. In those cases, we see that (2.70) yields what 
the theorem claims. It remains to prove that if p = 3 and 2a = b+ 1, then 
vy =1+13(P? — Q’). On the one hand, by (2.69), v3(U3(V2,Q?)) = 2b + v. 
On the other hand, U3(V2, Q?) = Ror = (P? — Q)(P? — 3Q). Thus, 


v3 (U3(V2,Q7)) =b + ((b+ 1) + 13(P? — Q’)) = 2b4+14+73(P? — Q’). 


This yields the value of v. 


We summarize the last three theorems integrating the cases when b = 0 
into our statement and avoiding the use of Iverson symbols. 


Theorem 46 Suppose U = U(P,Q), U' = U(P’,Q’), where P = p*P’, a> 
0, Q=pQ’, b> 0, and pt P'Q. 
If b > 2a, then, for alln > 1, 
Up(Un) = (n — 1)a + vp(U;). 
If 2a > b, then for alln > 1, 


Vp(Uan+1) = bn, 
Vp(Uan) = bn + (a— 6) + p(n) +h, 


where 


0, otherwise. 


An [Os if2<p<3, 2a=b+1 andp|n; 


Now we express the p-adic valuation of V,. By the identity U2, = UnVn, 
we have Yp(Vn) = Vp(Uan) — Yp(Un). Thus, using Theorem 46, we readily 
obtain the following: 


Theorem 47 Suppose V = V(P,Q), V’ = V(P’,Q’), where P = p*P’, a> 
0, Q=pQ’, b> 0, and pt P’Q'. If b > 2a, then, for alln > 1, 


Up(Vn) = na+V,(V,). 
If 2a > b, then, for all positive V indices, 


Up(Van41) =bn+at+ Vp (2n + 1) + k, 
Vp(Van) = bn + £, 


References 51 
where 


he (lias if p= 3, 2a=b4+1 andn=1 (mod 3); 


0, otherwise, 
and 
1+1(P?—-Q'), ifp=2, 2a=b+1 andn=1 (mod 2); 
Le, ifp =2 and2|n or 2a>b+1,; 
0, otherwise. 
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Chapter 3 
Applications 


I suspect at its best your education’s main motive is to fuel your curiosity and teach 
you how to find things for yourself... The educationists seem to think in terms of 
methodical steps but a teacher brimming with passion for the subject is what actually 
works. (Jim Harrison in “Off to the Side”) 


Abstract The purpose of this chapter is to review several diverse applications 
of the Lucas functions, particularly in the area of computational number 
theory. We will begin with a largely historical description of the Mersenne 
primes and then examine how the Lucas functions have been applied to the 
problem of primality testing. As this requires that we be able to compute U, 
and/or V,, for large values of n, we provide in Sect. 3.3 some algorithms by 
which this can be done. We next describe several subjects, such as solving 
congruences, integer factorization, Diophantine equations, and cryptography 
in which the Lucas functions have been applied. 


3.1 Mersenne Primes 


We have already mentioned that a very early application of the Lucas se- 
quences was to the problem of proving that 21?” — 1 is a prime. For reasons 
that we will discuss later in this section, it is now customary to call prime 
numbers of the form 2” — 1 Mersenne primes. In this section, we provide a 
brief discussion of the history and the primality of such numbers; more detail 
and references for this material can be found, for example, in Sects. 1.4 and 
2.1 and Chap. 3 of [77]. 

We say that an integer d is an aliquot or proper divisor of a positive integer 
nif0<d<nand d divides n. For example, 3 is an aliquot divisor of 6, but 
neither 5 nor 6 is. We define the function s(n) of a positive integer n to be 
the sum of all the aliquot divisors of n. Some examples are 
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s(6) = 142436, 

28) =14+2+4+447414 = 28, 

60) =14+24+344+4+5+4+6+410412+15+4 20+ 30 = 108. 


Euclid defined a positive integer n to be perfect if s(n) = n. We see from 
the above that both 6 and 28 are perfect. It is not clear what the origin of 
this concept is, but it is telling that if we have s(n) = n, then the sum of 
the reciprocals of all the divisors of n is 1. As much of Greek mathematics 
at the time of Euclid and earlier was being developed in the Egyptian city of 
Alexandria, it is possible that the idea of numerical perfection arose from the 
ancient Egyptian technique of performing arithmetic using only unit fractions 
(reciprocals of positive integers). 
Euclid proved that if 
m= ea a _ 1), 


where 2” — 1 is a prime, then m is perfect. Much later, Euler, in a paper 
published posthumously in 1849, showed that if m is an even perfect number, 
then it must be of Euclid’s form. Although there has been much activity in 
searching, no odd perfect number is currently known; if one does exist, it 
must exceed 10'°°° (Ochem and Rao [56]). 

Thus, to find the even perfect numbers, it is necessary to find the values of 
n such that 2" —1 is prime. As we mentioned earlier, such numbers are today 
called Mersenne primes after the French Minim friar Marin Mersenne (1588- 
1648). Mersenne was very much involved in the organization of European 
culture during his time; he entertained several important scientific visitors 
at his cell and maintained a voluminous correspondence with many of the 
leading savants of the seventeenth century. For much more information on 
these activities, especially as they affected mathematics, the reader is referred 
to the paper of Grosslight [28]. 


In 1644, Marin Mersenne stated that all the values of n < 257 for which 
2” — 1 is prime are given in the following list: 


n = 2,3,5,7, 13, 17, 19, 31, 67, 127, 257. 


(The correct list is n = 2,3,5,7,13, 17,19, 31,61, 89,107,127, but this was 
not proved until more than two centuries later.) It is because of Mersenne’s 
early interest in numbers of the form M,, = 2” —1 that they are today called 
Mersenne numbers. We remark here that (M,,) (= (Un(3, 2))) is a divisibility 
sequence. It follows that if M,, is a prime, then so must n be a prime. 

The early Greeks knew of the first four numbers in Mersenne’s list and pos- 
sibly more. At the time of this writing, 51 Mersenne primes have been iden- 
tified. All of these can be found at the GIMPS [26] (Great Internet Mersenne 
Prime Search) website. The 51st of these is Mgo5g9933, a number of over 24 
million decimal digits and the largest prime currently known. 
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When Lucas published [41], he was unaware of Mersenne’s list; he learned 
of it a few months later from Genocchi [36] and was more impressed by it 
than he should have been. This was because Mersenne was a correspondent 
of Pierre de Fermat, one of Lucas’ mathematical heroes, and this to Lucas 
meant that Mersenne was likely in possession of unpublished and important 
mathematical information that had been lost. 

We have already stated that Lucas made use of the properties of the Fi- 
bonacci and Lucas numbers to prove that Mj97 is a prime. The two theorems 
that he relied on in [41] were not proved there or correctly in any of his sub- 
sequent papers. It was Carmichael [19] in 1913 who provided correct proofs 
of these results. Nevertheless, it seems that Lucas had the right idea, and in a 
paper (Lucas [42]) that he published in April 1877, but had largely completed 
as early as August of 1876, he gave a reasonably complete (for Lucas) and 
mostly correct account of his ideas for proving Mj27 a prime. He knew that 
(F;,) is a divisibility sequence and that Fo, = Fy Dn, Lon = L? —2(—1)", and 
L? — 5F? = 4(-1)". From the last of these, he knew that ged(F,, Dn) | 2. 
Furthermore, he was aware that for any positive m, there exists the rank of 
appearance p(m) of m in (F),) and that if m | F,, then p(m) | k. He proved 
that if p (# 5) is a prime and we define e, = 1 when p = +1 (mod 5) and 
€» = —1, otherwise, then p(p) | p — ép. 

Now put 7; = Loi and 5; = Fi. We have r1 = Le = 3, rig1 = 7? — 2, and 
r; is odd for i > 1. Also, s5;41 = s;r; and gced(s;,r;) = 1. At this point, Lucas 
once again appeals to a result for which he did not have a correct proof, 
but he was likely aware that this rather general result was not needed for 
his immediate purposes. Let p be any odd prime such that p | r; (note that 
p # 5 because 2 is a quadratic nonresidue of 5). We see that p | 5;,1 and 
because ged(s;,r;) = 1, we have p{ s;. It follows that p(p) | 2’*', p(p) { 2", 
and therefore p(p) = 2't+. Since p(p) | p+ 1, we have 2*t! | p+1, which 
means that p = 2'+!k +1. Thus, Lucas could show that if N = M, = 2-1 
and N | r;, then any prime divisor p of N must be of the form 2’+'k +1 
(> 2*+1 _ 1). Ifn =3 (mod 4), then M, = 2443 -1=8-—1= 2 (mod 5). 
Thus, if N = M,, is a prime, then ey = —1, and we have N | Fiy41. Since 


Fn4i = $n = $n-1l n-1 = Sn—2Tn—-2T n-1 = $n—3ln-3Tn-2Tn-1 


= 117273 °°*Tn—-1; 


Lucas was aware that the values of the r; here must be prime to one another. 
Thus, we can test N for primality simply by finding some i (1 <i < n-—1) 
such that N | r;; such a value of ¢ must exist if N is a prime. A somewhat 
abridged (and corrected) version of Lucas’s test (Lucas [41]) is the following: 


Theorem 48 Let N = M, = 2" —1 with n = 3 (mod 4). If none of the r; 
fori = 1,2,3,...,n—1 is divisible by N, then N is composite. If i is the least 
value of j (1 <7 <n-—1) such that N | r;, then any prime divisor p of N 
must be of the form 2°**1k +1. 
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Notice that if 24 > n, then the least prime which can divide N must 
exceed VN, which means that N must be a prime; thus, if M,, divides rp_1, 
then M,, is a prime. At this point (1876), Lucas was unaware that testing 
the divisibility of rn_1 by Mp is not only sufficient to prove the primality 
of M, = 2" —1 with n = 3 (mod 4), but it is also necessary. Two years 
later, he showed (Lucas [45]) that if p is any prime such that (5|p) = —1 and 
p = —1 (mod 4), then L,,;; = —2 (mod p). Hence, if N = M,, is a prime, 
then since M,, = —1 (mod 4), we have —2 = Ly 41 = Liy44)/2 — 2, and we 
get N | Lyw41)/2 or N | rn-1. Thus, Lucas discovered: 


Theorem 49 If M,, = 2” — 1 with n = 3 (mod 4), then M,, is a prime if 
and only if M,, divides rn_1, where r1 = 3 and ri41 =r? — 2 (mod M,). 


This, of course, is a test for the primality of M, = 2” —1 with n = 3 
(mod 4), but why did he choose to test Mi27? Had he known of Mersenne’s 
list, this would be easier to answer, but all he knew was that 127 = 3 (mod 4) 
and 127 is a prime. There are several other candidates that he might have 
tried, and it remains a mystery as to why he selected Mj97. Possibly, he was 
impressed by the following: 


3 = 2? —1 = Mo,7 = 2? — 1 = Mg, 127 = 2? —1 = My, 


and thought it was natural to try Mj27 as it is the next number in the 
progression.’ In any event, he put r; = 3 and computed rj25 (mod Mj27) 
recursively by using rj41 = 7? — 2 (mod Mjy7). This requires 125 modular 
squaring operations, most of which involve numbers of 39 digits, at the time 
an enormous undertaking. However, by observing the simple congruence 


gmtn =2™ (mod M,), 


he was able to develop an ingenious technique, involving the simple movement 
of pawns on a 127 x 127 chessboard, to compute the value of rj41 = r? — 2 
(mod Mj.7) from the value of r; (mod Mj27). As his overall procedure was 
a somewhat exacting and time-consuming process, he only performed it once 
and at the end discovered that ri2g = 0 (mod Mj 7). 

Lucas realized that he could not use the Fibonacci and Lucas numbers 
to establish the primality of M, = 2” — 1 with n = 1 (mod 4) because in 
this case, M,, = 1 (mod 5) and therefore €,,, = 1 whenever M,, is a prime. 
Perhaps this is what motivated him to investigate the features of the Lucas 
sequences. He quickly discovered that the properties of the Fibonacci and 
Lucas numbers could be extended to (U;,) and (V,,), but it took him several 
months to produce a prime test for such M,,. In Lucas [43], he suggests using 


+ Thus, M127 is a Mersenne prime, but M47 Which has more than 103” digits is currently 
out of reach. 
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P =4 and Q = 1. In this case, D = 12 and (D|M,,) = (3|M,). Since 
Moxy. = 2?*+1 — 1 = 1 (mod 3) and also Mox41 = —1 (mod 4), we get 
(3| Mn) = -—(M,|3) = —1. If we put r; = Voi, then rp = Vi = 4 and 
ri41 = 7? — 2 for i > 0. Thus, we have rp_1 = Viw41)/2 when N = Mp. 
Proceeding in an analogous fashion to the case of n = 3 (mod 4), we see 
that if N | rn_1, then N must be a prime. Notice that because we only 
assumed here that n is odd, this test will also work in the case that n = 3 
(mod 4). This, of course, is half of the celebrated Lucas-Lehmer test for the 
primality of Mersenne numbers. Lucas, however, paid little attention to it, 
mentioning it as little more than an afterthought at the end of his paper, 
but he did cite it in Lucas [46], as this was the test used by Hudelot to 
ascertain the primality of Mg. In [45], he unwittingly proved that this test 
was both necessary and sufficient for any M,, to be prime for any odd n 
(= 2m+1)> 3. In the last result of this paper, he puts P = 2+ and 
Q = —1; then D = 2?™*2 4 4 = 4(2?” +1). If N = Mp, then the Jacobi 
symbol 


éy = (D|N) = (2? 41/2 = 1) = (2? 1 1/279 4-1) 
= (-3| 2?" +1) = (2?” +.1]3) = (2|3) =—-1. 


Thus, by the same reasoning as used above, we see that if N | Vin+1)/2, then 
N must be a prime. Earlier in the paper, Lucas had established that if p is 
an odd prime, then for any P and Q, we have 
QWVp41 = PP*1 4 DPW? = p+ 6D (mod p). 

Thus, if ¢, = —1 = Q, we get Vp41 = —2 (mod p). Since Vp41 = Vo.44)/2 — 
2Q?+)/?. it follows that Vip41)/2 = 0 (mod p) whenever p = 3 (mod 4). 

As a consequence of the above, we see that if N = M,, is a prime, then 
N | Vingiy/2 when P = 2+! and Q = —1. If, as earlier, we define rj = Vo:, 
then ry = Vo = P? — 2Q = 277242 = 4 (mod N), rn_-1 = Vin41)/25 
and ri41 = 7? — 2 for i > 1. Lucas had, without realizing it, proved the 
Lucas-Lehmer test for the primality of M,, for any odd n: 


Theorem 50 Let N = M,, for any odd n > 3, and put r; = 4. If we define 
rit. =r? —2 (mod N) fori>1, then N is a prime if and only if rn_1 = 0 
(mod NV). 


This is the test that has been used to find all the known Mersenne primes 
M,, for n > 127. Indeed, it is remarkable that tests similar to this exist for 
more exotic forms of N. See, for instance, Roettger and Williams [61] for 
examples and further references. 
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3.2 Primality Testing 


Suppose NV is an odd positive integer and we wish to know whether or not 
N isa prime. By Fermat’s little theorem, we know that if N is a prime, then 
for all integers a, we must have 


aN =a (mod N). (3.1) 


Thus, if this Fermat congruence does not hold for any value of a, then N 
cannot be a prime. Unfortunately, for primality proving, it is possible for 
composite integers N to satisfy this congruence for some value of a. For 
example, consider a = 2 and N = 2701 = 37-73, an example known to 
Lucas. Hence, the direct converse of Fermat’s theorem does not hold. Indeed, 
there exists an infinitude of composite numbers, called Carmichael numbers, 
for which the Fermat congruence holds for all values of a. See, for example, 
Sect. 3.3 of Crandall and Pomerance [21] or Sect. 15.1 of [77]. If we want to 
use the Fermat congruence as a prime test, we need only concern ourselves 
with those values of a such that gcd(a,N) = 1. In this case, the Fermat 
congruence becomes 

a’-1=1 (mod N). (3.2) 


Thus, if (3.2) does not hold for any value of a such that gcd(a, N) = 1 and 
1 <a<_N, then N cannot be a prime. Of course, if gcd(a, N) 4 1 and 
1 <a<_N, then it is not necessary to verify (3.2) because N is clearly 
composite. It can also be shown that if N satisfies (3.2) for all a such that 
gcd(a, N) = 1, then N is a Carmichael number. 


Now observe that a Mersenne number is a special case of a number N for 
which we know all of the divisors of N + 1. In order to prove such values of 
N to be prime, Lucas [44, Sect. 27] advocated the use of what he called his 
fundamental theorem. This holds for all regular U sequences. 


Theorem. If N is odd and N | Un_+, where |y| = 1, but N { Ug for all 
aliquot divisors d of N — 7, then N is a prime. 


Notice that, if for some integer a, we put P =a+1 and Q =a, the above 
result can be used to prove the following corollary: 


Corollary. If N is odd and N | aN~-!—1 and N ¢ a4 —1 for all aliquot 
divisors d of N — 1, then N is a prime. 


This result was what Lucas [47, p. 441] considered to be the converse of 
Fermat’s theorem. Later, Lehmer [34] pointed out that this result could be 
made more efficient by replacing the values of d above by (N — 1)/p for each 
distinct prime divisor p of N — 1. A similar observation can also be applied 
to Lucas’ fundamental theorem. 
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Although he made several attempts, Lucas was never able to provide a 
correct proof of his fundamental theorem; the ingredients for providing a cor- 
rect proof were finally given by Carmichael [19]. A somewhat more accessible 
proof can be found in Sect. 4.3. 

The difficulty in using Lucas’ fundamental theorem in primality proving is 
that in almost all cases, the complete factorization of N — ¥y is not available. 
In [44] and in some earlier papers, Lucas used his functions to test numbers 
other than the Mersenne numbers for primality. For example, although he 
did not state this result explicitly, Lucas seemed to be aware of the following 
theorem. 


Theorem 51 Let N = A2”—1, n > 1, where A is odd and A < 2” . If 
gcd(N,Q) =1 and N | Vin4iy/2, then N is a prime. 


Proof Suppose p is any prime divisor of N. Then p | Viv41)/2, and it follows 
that p | Uy41. Thus, p(p) | A2”. Since p does not divide Q, we see, by (2.8), 
that p cannot divide U;y41)/2 and therefore p(p) { A2”-1, We must have 
2” | p(p), which means, by the reasoning used in the proof of Theorem 48 or 
by appealing directly to Theorem 9, that p = k2" +1. If p = 2” — 1, then 
2” = 1 (mod p) and thus N = A—1 (mod p). Since A—1 < 2" — 1, we find 
that A = 1, which means that N = p. Otherwise, p > 2” + 1 > WN, which 
also implies the primality of NV. 


He also discussed the primality of numbers of the form 


Ap” ey, (3.3) 


where y = +1, pis a prime, and p{ A (in Lucas’ tests, p = 2, 3, 5, and 7 is re- 
stricted to be 1 when p = 5). Unfortunately, his tests were given without proof 
and need some corrections and modifications, which were provided recently 
by Roettger et al. [60]. Furthermore, these tests are sufficiency tests only. A 
version of the test of Theorem 51 which is both necessary and sufficient was 
given by Lehmer [36]. 


Theorem 52 Let N = A2” —1, where n > 1, A is odd, and A < 2”. If 
(D|.N) =(Q|N) =—1, then N is a prime if and only if N | Vin41)/2- 


Proof This result follows from the law of appearance (Theorem 9), Theo- 
rem 36, and Theorem 51. 


This result, however, is not effective in that it does not provide an explicit 
recipe for finding the values of P and Q which can be used. However, in the 
case where 3 { AN, it is a simple matter to see that when n > 3, we can always 
use P = 2 and Q = —2, but when 3 | A, the problem of finding suitable values 
for P and Q becomes more difficult; however, in [12], Bosma showed how to 
produce such P and Q as long as A 4 4 — 1. These results were improved 
by Berrizbeitia and Berry [7] to the case of 5 + A. More recently, Deng and 
Huang [22] revisited this problem and dealt with the case of 17 { A. 

We can also approach this problem by using the lemma below. 
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Lemma 19 Let p be a prime such that p= —1 (mod 4). There exist P and 
Q such that (D|p) = (Q|p) = —1 if and only if Q = a2 +b? and P = 2a 
(mod p), where a and b are integers such that (a? + b? |p) = -1. 


Proof Suppose Q = a?+b?, P = 2a (mod p), and (a?+b? |p) = —1. We have 
(Q|p) = —1and pt b. Since D = —4b? (mod p) , we have (D| p) = —1. Next, 
suppose that (D|p) = (Q|p) = —1. We have (4Q — P?|p) = 1; hence, there 
must exist some c such that c? = 4Q—P? (mod p), and as a consequence, we 
have Q = (271 P)?+(271c)? (mod p); the result follows on putting a = 2~!P 
and b = 2~1'c (mod p). 


We can combine Theorem 52 and Lemma 19 to produce the following test: 


Theorem 53 Let N = A2"—1, wheren > 1, A is odd, and A < 2”. Let q 
be a prime such that q=1 (mod 4) and (N|q) = —1. Ifa and b are integers 
such that q = a? + b? and we put P = 2a and Q = q, then N is a prime if 
and only if N | Vin+1)/2- 


If the g in this theorem is small, we can find a and b by trial, but if it is 
large, Brillhart’s [13] modification of Hermite’s algorithm can be used to find 
them very effectively. 

The main ingredient in proving the above tests is the prior determination 
of the forms of possible prime factors of N. Most of these results make use 
of techniques which show that all prime divisors of N must have a certain 
form, but it is sometimes possible to proceed when we know that N has at 
least one prime divisor of a certain form. 


Lemma 20 Suppose that N is given by (3.3), wheren > 1 and A < p”. If 
N has at least one prime divisor r of the form kp" + €, where « € {1,—-1}, 
then N is a prime. 


Proof If N is composite, then N = rT, where r = kp” +6, k > 1, and 
T > 1. Since rT = ¥ (mod p”), we must have T = hp” + ey and h > 1. 
Hence, (kp” + €)(hp” + ey) = Ap” +4; it follows that 


A=hkp” + €(yk +h) = hk(p” —1)+ (k+6)(h+e7)-7. 


Suppose p is odd. In this case, since r and J must be odd, we see that both 
h and k are even. Hence, 


A> Alp’ =1)—1> p”, 


which is impossible by selection of A. 


Next, suppose p = 2. In this case, since A must be odd, we see that h and 
k are of different parity, and therefore hk > 2. Thus, 


A> 22" =1)=1=9""" =3 5.2", 


another impossibility. Hence, N must be a prime. 
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Note that if use is made of the reasoning of Robinson in Theorem 10 of 
[59], it is often possible to refine the above argument to increase the upper 
bound on A. 

We can now use the Lucas sequences to establish that a number N has a 
prime divisor of the form kp” + €, where ¢ € {1,—1}. 


Lemma 21 Let p be a prime and N be a positive integer such that gcd(N, 2pQ) 
=1. If for some positive integer m we have N | Ump and N{U,, there must 
exist a prime divisor of N of the form kp" +e, where e € {1,—1} and p” || mp. 


Proof Since N | Um», and N { Um, there must exist a prime divisor r of 
N such that r*|| N (a > 1) and r* { U,,. It follows that p(r*) | mp and 
p(r*) ¢ m; hence, p” | p(r*). Since, by Corollary 11, we have p(r*) = r*p(r) 
for some k > 0 and p # r, we must have p” | p(r). By Theorem 9, we have 
p(r) |r —e,, where €, = 0,1, or —1, so we see that r = +1 (mod p”). 


By combining the results of Lemmas 20 and 21, we get the following: 


Theorem 54 Let N be of the form (3.3), wheren > 1 and A < p”. If 
gced(N,2Q) =1, N| Un_y, and N { U(n_+)/p, then N is a prime. 


Proof Put m = (N —7)/p. We have p” || mp, and by Lemma 21, there must 
exist a prime divisor of N of the form kp” + €, where ¢ € {1,—1}. Thus, NV 
is a prime by Lemma 20. 


Theorem 51 can be generalized to deal with numbers given by (3.3). 


Theorem 55 Let N be of the form (3.3), wheren > 1 and A < p”. If 
gced(N, 2Q) = 1 and N | Uy_,/U(n_y)/p, then N is a prime. 


Proof In the case that p = 2, the proof is similar to that of Theorem 51. 
Suppose p is odd and s = (p — 1)/2; by (2.38), we have 


Up f Up = pQ” (mod U;), 


where r = (N — y)/p. Thus, by the conditions of the theorem, N { Uy—+)/p 
and N | Uy_—+. The result now follows from Theorem 54. 


Unfortunately, it is rather difficult to convert Theorems 54 or 55 into nec- 
essary and sufficient tests for the primality of N. However, results in Williams 
[74] can be used to prove the following p = 3 analogue of Theorem 53. 


Theorem 56 Let N be of the form (8.3), where p = 3, n>1, and A < 3”. 
Suppose q is a prime congruent to 1 modulo 3 such that NG-Y/3 4 1 (mod q) 
and 4q = t? + 27u?, where t = 1 (mod 3). If gcd(N, qu) = 1, then N is a 
prime if and only if Vam(t,q) = —q'” (mod N), where m = (N — )/3. 
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A somewhat more general version of this result, proved by a different 
technique, was given later by Berrizbeitia and Berry [5]. 

A proof that for any given prime g = 1 (mod 3), values of t and u must 
always exist can be found in Sect. 6 of Chap. 9 of Ireland and Rosen [31] and 
in Chap. 4 of Cox [20]. Also, the value of D here is —27u? and ey = (D| N) = 
(-3|N) = (N|3) = 7. 

Further results for a general odd p are presented at some length in 
Sects. 11.3 and 16.4 of [77], and this matter is examined further in Sect. 7.4. 
In particular, we have a generalization of Theorem 56 in Theorem 11.3.6 of 
[77]. This can be converted into a necessary and sufficient test (Algorithm 
11.3.7 of [77]) for the primality of N in (3.4) as long as some P and Q are 
given such that (D| N) = 7, N ¢{U,(P,Q), and a prime g (= 1 (mod p)) can 
be found such that N(@-))/? #1 (mod q). 

We should point out here that Berrizbeitia et al. [6] have given a very 
general necessary and sufficient algorithm, not requiring Lucas functions, for 
proving primality of a wider class of numbers than those in (3.3), but these 
tests are not in general effective. However, as we have pointed out earlier, 
in certain cases, such as the Mersenne numbers, it is possible to use the 
Lucas functions to produce an effective, necessary, and sufficient test for the 
primality of NV. For example, this is always possible for a generalization of 
the Mersenne numbers: N = (p—1)p"—1, when 2 < p < 10’, but p £3 (see 
Stein and Williams [68]). The generalization N = 2p” — 1 has been discussed 
by Roettger and Williams [62]. 


We have seen that the Lucas functions can be used to produce a test for 
the primality of N when either N + 1 or N —1 has a large prime power 
factor. These are examples of primality proving when a partial factorization 
of N + 1 is available. In 1975, Brillhart, Lehmer, and Selfridge [14] showed 
how to make use of the Lucas functions and partial factorizations of both 
N +1 and N — 1 in order to establish the primality of NV. These techniques 
are also explained in Sects. 4.1 and 4.2 of [21] and in Sects. 12.2 and 12.3 of 
[77] and remain to this day the best methods for primality proving as long as 
enough factors of N+1 are known. Of course, this is usually not the case, and 
many other techniques for primality proving have been advocated since 1975; 
these are very ably described in [21] and from a historical perspective in [77]. 
Since the publication of these books, there have been several further advances. 
In 2002, Agrawal, Kayal, and Saxena announced a deterministic polynomial 
time algorithm for primality proving. Their technique, which does not involve 
Lucas functions, is described in [2] and in the book of Dietzfelbinger [24]. 
See also the insightful paper of Granville [27]. The elliptic curve primality 
proving (ECPP) technique of Atkin and Morain (see Chap. 7 of [21]) has been 
considerably refined by Morain [51], resulting in what is called fastECPP. 
This method has been used by Franke et al. [25] to establish the primality of 
numbers of up to 10000 decimal digits. 


We noted at the beginning of this section that it is possible for a composite 
N to satisfy (3.2) for some a > 1 such that gcd(a, N) = 1. We call such a 
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value of N a base a pseudoprime or a-psp. By defining ey to be the value 
of the Jacobi symbol (D|N), we can extend this idea to the Lucas function 
U,. For a given pair of coprime integers P, Q, we say that N is a Lucas 
pseudoprime if N is composite, gcd(N, QD) = 1, and 


Un—ey (P,Q) =0 (mod N). 


Certainly, by the law of appearance, this congruence must hold when N is a 
prime. 

Suppose, for a given odd N, we define D to be the first element in the 
sequence {5,—7,9,—11,13,...} such that (D|N) = —1. In spite of the offer 
of a cash reward, since 1980, no composite N has ever been found for which 
N is both a 2-psp and a Lucas pseudoprime with P = 1 and Q = (1— D)/4. 
Undoubtedly, such N exist, but they seem to be very rare. This observation 
has led to the inclusion of this Lucas pseudoprime test in NIST’s Federal 
Information Processing Standards Publication 186-4 (Appendix C.3.3) tests 
for the probable primality of integers employed in digital authentication al- 
gorithms. For more information concerning this topic, see Pomerance et al. 
[57], Sect. 3.5 of [21], and Chap. 15 of [77]. 


3.3 Fast Computation of U, and V,, (mod m) 


Let N denote any positive integer. For any integer x, it is well known (see, e.g., 
Algorithm 1.2.2 of [77]) that «™” (mod N) can be computed in O(log m) mod- 
ular multiplication operations (mod N). Now consider this problem with 
respect to K € Q(Vd), where d is an integer and Vd is irrational. We note 
that « = (a+ bVd)/c, where a,b,c are in Z, and that 


cm = Yn) + b/d Um(P, Q), 


where, in this case, P = 2a and Q = a? — db”. If ged(c, N) = 1, then 


K™ = Vn(P,Q)/2+ bVd Um(P,Q) (mod N), 


where P = 2ac~!, Q = (a? — db”)c~? (mod N), and cc~! = 1 (mod N). 
Thus, the Lucas functions can play a role in determining «” (mod NV). In 
many applications of the Lucas functions, such as primality testing, it is often 
necessary to compute rapidly some remote values of V,, and/or U;, (mod N). 
As pointed out in Sect. 4.3 of [77], this can be done by first observing the 
simple identities: 
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Uon = 2Un11U, — PU?Z, 
Vensi = Uns — QUa: (3.4) 
Venta = PUny1 — 2QUnUn4- 


We next define {A,B} = {E,F} (mod N) to mean that A= E and B=F 
(mod NV). Let (bo, 61, b2,..., bg )2 be the binary representation of m. Suppose 
P; = {A, B}; when bj41 = 0, put Pi41 = {2AB— PA”, B?-QA?} (mod N); 
when bj41 = 1, put Pi41 = {B? — QA”, PB? — 2QAB} (mod N). If we put 
Po = {1, P}, we see by the identities (3.4) that P, = {Um,Um+4i}. We can 
find V,, (mod NV) by using the identity V,, = 2Um+s1 — PUm. 

The above technique can be somewhat less efficient than the following 
method, which can be used when gcd(PQD,N) = 1, a case that occurs 
frequently. We define 


Wr =Q°-"Von (mod N), (3.5) 


where QQ-! = 1 (mod N) and Q-" = (Q7?)”. Also, for any integers s and 
t, we have Wo, = W2 — 2 (mod N) by (2.24) and Wet = WW: — Wet 
(mod NV) by (2.17). We also have the following results, all modulo N: 
Von = Q'Wa, 
PVonsi = Q?*(War + Wrst), 
DUons1 = Q"** (Wisi — Wr), 
PDUon, = Q"(2QWh41 — (P? — 2Q)Wr). 


(3.6) 


Thus, if h = |m/2|, we can compute V,,, and U,, modulo N if we know 
W,, and Wn+1 (mod N). Now suppose that (bo, b1, b2,...,b¢)2 is the binary 
representation of h, and put Po = {Wi,W2}, where W; = P?Q-!— 2 
(mod N) and W2 = W? — 2 (mod N). Suppose P; = {A, B}; if bi41 = 0, 
put Pitt = {A 2,AB Wi}; if bia = 1, put Pi4t = {AB = W,, B? = 2} 
(mod N). We see that Py = {W),, Writ}. 

Furthermore, given values for W;,, and Wr+41 (mod N) and some posi- 
tive integer t, we can quickly evaluate W;, (mod N). If (bo, bi, b2,..., de )2 
is the binary representation of t and Po = {Wn, Wr+i}, we compute Py = 
{Win, Win4iy} in a similar fashion to the above scheme. For P; = {A, B}, 
we calculate Pj41 = {A?—2, AB—W,} (mod N) when bj41 = 0; if 41 = 1, 
compute P41 = {AB — W;, B? — 2} (mod N). 

We next define the polynomial G,,(x) for m > —1 by G_i(x) = —1, 
Go(x) = 1, and Gyii(x) = «G;(x) — Gy_i(x) (k = 0,1,2,...). This poly- 
nomial is related to the w,(x) polynomial of Sylvester [71, p. 366ff]; more 
information can be found concerning it in Sect. 4.2. It is easy to establish by 
induction that 


(yt — 1) /(y—1) =y"Gn(y ty" )(m > -1). 
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Note that this holds even when y = 1 because G,,(2) = 2m +1. We also 
have G,,(—2) = (—1)™ for any m and G,,(—1) = 1 whenever 3 | m. If we 
put r = 2s+1 and y = —(a/8)", we get 


Var = (-1)°Q™G,(—Von/Q”)V, = (-1)°Q™G;(—W,)V, (mod , ' 
3.7 
if we put y = (a/G)”, we get 


Unr = Q”™°G.(Van/Q”")Un, = Q™*G(Wr)Un (mod N). (3.8) 


If we refer back to Theorem 55 and put m = (N — y)/r, we see by (3.8) that 
N | Un—-y/U(n—y)/r if and only if N | G,(W,). In the case of p odd, we can 
now restate Theorem 55. 


Theorem 57 Let N be of the form (3.3), wheren > 1 and A < p”. If 
gcd(N, 2Q) = 1, m=(N—Y)/p, and N | G,(W,,), where s = (p—1)/2, then 


N is a prime. 


3.4 Solving Quadratic and Cubic Congruences 


Let f(a) be any polynomial with integer coefficients, and let m be any positive 
integer. There is a simple algorithm for solving 


f(z) =0 (mod m); 


simply try all possible values of x in the range 0,1,2,...,m-—1, and record 
those values which satisfy the congruence. However, this procedure, while 
perfectly valid, is very inefficient when m is large. More effective techniques 
can be found in any elementary number theory textbook, but these rely 
upon the ease with which m can be factored (see Sect. 3.5) and the available 
routines for solving 

f(x) =0 (mod p), 


where p is a prime divisor of m. The general problem of solving a polynomial 
congruence modulo a prime has attracted a considerable body of literature. 
See, for example, paragraph 4.3 of Lidl and Niederreiter [40]. In this section, 
we will confine our discussion of this problem to that of solving polynomial 
congruences of degrees 2 and 3. 
We first describe how a Lucas function can be employed to solve the con- 
gruence 
z*=a (mod p), (3.9) 
where p is an odd prime and a is a quadratic residue of p, i.e., the Legendre 
symbol (a|p) = 1. See [75, p. 228]. Put Q = a, and find by trial a value of 
P such that (D|p) = —1, where D = P? — 4Q. This latter process might be 
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considered a defect in the method, as it is unknown how many possible values 
of P we would have to try until we find one such that (D|p) = —1, but in 
practice, a suitable value for P appears after only a few trials. By Theorem 36, 
we must have p | U(p+1)/2. Thus, by (2.8), we get Vii iyjy = AQietl)/2 
(mod p). Since Q&+)/? = aa-Y/? = a (mod p), we see that the roots of 
(3.9) are given by +27'V(,41)/2 (mod p). 


The above idea was first suggested by Cipolla in 1903 and rediscovered by 
Lehmer in 1969. 

We note that simpler solutions to (3.9) exist for p = 4n+3 and p = 8n+5. 
If p = 4n+3, then 


p-1 
a2"? — aq?"t1 — gat =a (mod p). 


Thus, ta"t! (mod p) are the solutions of (3.9). 
Legendre noticed that if p= 8n +5, we have 


Bae a qint2 


1=(a|p)=a? (mod p); 


2n4+1 — 4 


thus, a +1 (mod p). In the case of a?”*! = 1 (mod p), the solutions 
of (3.9) are clearly ta"*! (mod p); in the case of a?”*! = —1 (mod p), the 
solutions are +(1/2)(4a)"*! because 


(+ (1/2)(4a)"t1)* = 42rtig2nt? — 94n+2q2nt1q = (2 | p)a=a (mod p). 


This observation of Legendre was extended to p = 8n +1 by Tonelli in 1891. 
Tonelli’s technique was rediscovered and generalized by Shanks in 1972. 

We illustrate the Cipolla-Lehmer method for a prime of the form 8n + 1, 
namely, 89, and a = 2. Thus, Q = 2 and D = P? — 8. Trying P = 1 yields 
D=-—7 and 


(—7 | 89) = (89| 7) = (5| 7) = (2| 5) = —1. 


We need to compute V45. Using a variation on what we did in the previous 
section, i.e., the doubling and side-step formulas Uz, = UnVn, Van = V2? — 
2Q”, Uonsi = $(PU2n + Von), and Vany1 = $(DU 2), + PVo,), we obtain 
Von = V2 _ oe, VUon+1 = $(U2n + Von), Von41 = $ (Von _— Wan), and 


Vn 
i 
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so that the solutions are 


39 39-89 
2 2 


= +25 (mod 89). 


(We see that 2 is a fourth power residue modulo 89.) 


For more information and references concerning the solution of the 
quadratic congruence, see Miiller [53]. 


We now turn to the problem of solving the cubic congruence 


f(z) =2° + az? +b2+c=0 (mod p), (3.10) 


where a, b, and c are integers and p (> 3) is a prime. It may seem surprising, 
but by using an old idea of Cailler [18], Lucas functions can be used in many 
cases to solve (3.10). It is well known that if we put y = a+ 32, then 27f(a) = 
g(y), where g(y) = y® — 3Ay+ G, A = a? — 3b, and G = 2a? — 9ab + 27c. 
Thus, any solution y of 


gly) =y? —3Ay+G=0 (mod p) (3.11) 


can be readily converted to a solution x of (3.10). 


If p| A, we are left with the problem of solving 

y>=-G (modp). (3.12) 
If p = 2 (mod 3), then (3.12) has the solution y = —G??-/3 (mod p). If 
p = 1 (mod 3), a necessary condition for (3.12) to have a solution is that 
(—G)@-))/3 — G@-D/3 = 1 (mod p). Put t = (p— 1)/3, and suppose that 
3{t (this certainly occurs when 9 { p—1). In this case, there must exist some 
c€ {1,2} such that 3 | ct+1, and it is easy to see that (3.12) has the solution 
y = —G(+)/3 (mod p). In what follows, we will extend these ideas to the 
case where p{ A. 


We first observe that if p | G? — 4A%, then it is easy to see that y = 
G(2A)~! (mod p) is a solution of g(y) = 0 (mod p); thus, we will assume 
that p { G? — 4A’. It is convenient at this point to make use of the properties 
of finite fields; let F, be the finite field containing p elements, and let K 
containing F,, be the finite field containing p* elements. We may suppose 
that A and G? — 4A® are both nonzero in K. Also, there must exist a and 6 
in K such that a + 6 = G/A and a8 = A. We have A?(a — 8)? = G? — 4A3, 
which means that a # 6. Now let z = (y—a)/(y—) #1, where y € F, and 
a(y) = 0 (mod p); we have y = (Bz — a) /(z — 1) and 9(y) = A(z)/(2 ~ 1), 
where h(z) = B(a — 8)?z3 — a(a — 8)?. It follows that 8z3 = a in K. Put 
D = (a— B)? and e, = (D|p). Since, in K, a? = a when €, = 1 and a? = 8 
when €, = —1, we see that z?~“ =1 in K. 

Suppose that 3 { p — €,; then, as above, there must exist c € {1,2} 
such that 3 | c(p — ep) + 1. Since z* = a/8, we must have z = (a/B)”, 
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where m = (c(p — €,) + 1)/3. Since y = (Bz — a)/(z — 1), we see that y = 
QUm-1(P, Q)Um(P, Q)~* (mod p) is a solution of (3.11) , where P= GA~1 
(mod p) and Q = A. 

Next, suppose that 3 | p — €. In this case, since z?- © = 1, we must have 
(a/B)-©)/3 = 1, which is equivalent to U(p—c,)/3(P, Q) = 0 (mod p). If this 
latter congruence does not hold, then (3.11) has no solution. If it does hold, 
then put t = (p—e,)/3, and suppose that 3 { t; there must exist c € {1,2} such 
that 3| ct+1 and z = (a/8)™ satisfies 23 = a/8 when m = (ct + 1)/3; thus, 
as we reasoned earlier, y= QUim—1(P, Q)Um(P,Q)~' (mod p) is a solution 
of (3.11). Hence, we can use the Lucas functions to solve (3.11) as long as 
91 p—&. 

By using a method of Williams [76], we can also use Lucas functions to 
solve (3.11) without needing to compute U,,(P,Q)~! (mod p). In order to 
do this, we need a preliminary lemma. As earlier, we assume that P = GA7! 
(mod p), Q= A, D= P? — 4Q, & = (D|p) £0, 1 = (Q|p), and ce {1,2}. 


Lemma 22 [ft is any divisor of p—€, such that r = (p—,)/t is odd, then 
Uce = 0 (mod p) <= > Vee = 2n°Q*t/? (mod p). 


Proof We first observe that since r is odd, we must have t even. By (2.8), it 
is clear that p | Uc, when Vo, = 2n°Q°/? (mod p). If p | Uct, we have p | U; 
or, possibly in the case of c = 2 and 7 = —1, p| VY; by Theorem 36. In this 
latter case, Vo, = —2Q* (mod p). By (3.8) and the law of appearance, we get 


0=U,-., = Q™G5(Vor/Q")U; = (—1)°Q*SU; (mod p), 


where s = (r—1)/2. Hence, we get U; = V; = 0 (mod p), which is impossible 
by (2.8); thus, we can only have p | U; and by (2.8) V; = 26Q*/? (mod p), 
where 6 = +1. It follows that Vo, = V,?2 — 2Qt = 2Q* (mod p). By (3.7), we 
have 


QP)? = Vp eg = Vee = (1)°Q*Gs(—Vae/Q*)Vi = 20Q"/?. 


Hence, we must have 6 = 7, and we have the lemma. 


We are now able to prove the following theorem. 


Theorem 58 Let p, €,, t, and r be defined as in Lemma 22, and suppose 
that 3 { t. Let c € {1,2} be such that 3 | ct +1. If p | Us, then y = 
—n°P~!Q(Wr41+W.) (mod p) is a solution of (3.11), where k = (ct — 2)/6 
and W;, was defined in (3.5). 


Proof By Lemma 22, we must have V., = 2n°Q“/? (mod p). By (2.11), we 
have 
Weer = VeVi + DU ei = 2n°Q*/?P (mod p). 


Hence, if m = (ct + 1)/3, we get V3m = °PQ“/? (mod p) and 
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V3 — 3Q"Vin = Vim = n°PQ*/? (mod p). 

Since m is odd, we have m = 2k + 1 and k = (ct — 2)/6. We find that 
(VinQ-")? — 3Q(VinQ-") = n°PQ (mod p) 


or 
(—1°VinQ-*)® — 3A(—1°VnQ7*) + G=0 (mod p). 


By the second formula in (3.6), we see that y = —n°P~!Q(We4i + We) 
(mod p) is a solution of (3.11). 


Thus, if p—e, #0 (mod 9), we can always find a solution of (3.10) by using 
Lucas functions. More results, particularly for the case that 9 | p — €,, and 
additional references concerning solving the cubic congruence can be found 
in Miller [54]. 


Example 59 Find a solution of 
y> —2ly—7=0 (mod 41). (3.13) 


In this case, we have p = 41, P = —1, Q =7, and D = —27 (mod 41). Thus, 
n = (7|41) =—1, e) = (-3|41) = —1, and 41 — e) = 42. Since 3 | 42 and 
9 { 42, we can put t = 42/3 = 14 and c = 1 so that m = (ct +1)/3 = 5 and 
k = 2. We have Up = 0, U; = 1, Uz = —1, U3 = —6, Us = 18, and Us = 29. 
Hence, y = 7-13-29-! = 7-13-17 = 30 (mod 41) is a solution of (3.13). By 
results in Sect. 3.3, we also have W, = P?Q-!-2=7-!—2=4 (mod 41), 
W2 = W? —2=14 (mod 41), and W3 = W,W2 — W; = 11 (mod 41). Thus, 
y = —n°P-1Q(We4i + We) (mod 41) = —1-7(14 + 11) = 30 is, as we have 
already seen, a solution of (3.13). 


3.5 Integer Factoring 


The integer factoring problem can be stated as follows: Given an integer 
N > 1, where N is not a prime, find an integer a with 1 < a < N such that 
a is a divisor of N. This rather simple sounding request has turned out to 
be an enormous challenge to mathematicians since antiquity, but within the 
last 50 years, it has become of great importance to the problem of providing 
secure communication. In this section, we will only discuss the application 
of the Lucas functions to this problem, but there is much more to integer 
factorization than what we will discuss here. For further information, the 
reader is referred, for the early history of the problem, to Williams and Shallit 
[79] or various chapters of [77] and for modern techniques to [21], Lenstra [38], 
and Wagstaff [72]. 
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In 1974, Pollard [58] produced a rather novel approach to this problem. He 
assumed that N has a prime divisor p, where the largest prime power divisor 
of p—1 is less than or equal to a preassigned bound B. Let P denote the set 
of all primes, and for each q € P, define the nonnegative integer (q) by 


gO < B< pt, 


Let R be the product of all g* for q € P; this is a finite product because 
A(q) = 0 when g > B. Also, p—1 must be a divisor of R by our assumption. 
Since a?~' = 1 (mod p) for any a such that ged(a, N) = 1, we must have 
a® =1 (mod p), and we find that p must be a divisor of ged(a® — 1, N). For 
any a such that gcd(a, N) = 1, it is relatively easy to compute a® (mod N); 
thus, we can use the above observation to find a factor ged(a® — 1, N) of N, 
as long as our initial assumption holds. Pollard also discussed a very useful 
second stage of his algorithm to be used when p—1 has a single prime divisor 
r, where r < B’, a larger bound than B, the upper bound on all the other 
prime power divisors of p— 1. It was by making use of Pollard’s idea with 
B=6-10° and B’ = 10" that Baillie (unpublished) discovered a new prime 
divisor of M257 in 1980. (See Sect. IITB2 of Brillhart et al. [15].) 


Guy [29] mentioned a possible technique, using Lucas functions, for factor- 
ing N when p+1 has only small prime power factors. This was implemented 
later by Williams [75]. In this algorithm, we assume that the largest prime 
power divisor of p+ 1, where p is some prime divisor of N, is less or equal 
to a preassigned bound B and, as above, let R be the product of all gq for 
q € P. In this case, we see that p+1 must be a divisor of R. If ged(Q, N) = 1 
and (D|p) = —1, then p | Up4i(P,Q) by the law of appearance. Hence, as 
above, we find that p | Ur(P,Q) and therefore p | gcd(UR(P, Q), N). As it 
is more convenient to compute Wr(P,Q) instead of Ur(P,Q), we observe 
that if p | Ur(P,Q), then V2(P,Q) = 4Q® (mod p) by (2.8) and therefore 
Wr(P,Q) = 2 (mod p) by (2.25). Thus, to find a non-trivial factor of N, 
we make use of the techniques described in Sect.3.3 to calculate Wr(P, Q) 
(mod NV) and then evaluate gcd(Wpr(P, Q) — 2, N). Just as in the case of Pol- 
lard’s p—1 method, there is a second stage of the p+1 method. This can be 
used when p+ 1 has a single prime divisor r, where r < B’ and B’ is a larger 
bound than B. These methods were improved later by Montgomery [50]. 


Bach and Shallit [3] have extended the p—1 and p+1 methods to produce 
a factoring technique which operates under the assumption that ®,(p) has 
only small prime power divisors. Here, ®,(a) denotes the kth cyclotomic 
polynomial, an irreducible (over the rational numbers) polynomial, of degree 
y(k), divisor of x* — 1. For example, 21(p) = p— 1, ®2(p) = p+ 1, ©3(p) = 
p?+pt+1, &4(p) = p*? +1, etc. See Sect. 4.3 for more information concerning 
the cyclotomic polynomials. 
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3.6 Diophantine Equations 


A Diophantine equation is an indeterminate equation whose unknowns are 
constrained to assume integral (or rational) values. For example, an integral 
solution of the Diophantine equation x? + y? = 2? is (3,4,5). The study 
of these equations began in antiquity and continues to be an active area of 
research to this day. While it may appear that these objects are of little 
importance, they have nevertheless inspired a great deal of very fundamental 
mathematics; the search for all of the solutions of the Fermat Diophantine 
equation x” + y” = 2” (n > 2) was responsible for the development of very 
sophisticated results involving algebraic geometry, and attempts to solve the 
equations like 2? — 2y? = n (for a given n) stimulated much research on 
rational approximations to irrational numbers. For a compendium of results 
on this topic up to 1920, the reader is referred to the second volume of Dickson 
[23]; for more modern results and techniques, see Mordell [52], Schmidt [65], 
Shorey and Tijdeman [63], Smart [66], and Steuding [69]. 
One of the most studied Diophantine equations is the so-called Pell equa- 
tion. This is the equation 
x? — dy’ = 1, (3.14) 


where d is a given non-square positive integer. It is important to realize 
that non-trivial (y 4 0) solutions always exist for (3.14), although it is not 
immediately clear that this is so. For example, +? — 19y? = 1 has the solution 
(170, 39), but the smallest value of |y| such that 2? — 1621y? = 1 is a number 
of 74 decimal digits. The Pell equation often turns up in the investigation 
of other Diophantine equations such as x? + y? + z? = 1 (Lehmer [37]) or 
(2? +.a)(y? +a) = (az? +67)? (Williams [73]). See also Chap. 8 of [52]. A very 
readable account of the Pell equation can be found in Barbeau [4]. 

It is well known that all solutions of (3.14) are given by (Xn, Yn), where 
X,, and Y,, are generated from a fundamental solution (X1, Y1) by 


Xn +VdYp = (X1+VdY,)" 


for all integral values of n. The problem of how to compute this fundamental 
solution is discussed in some detail in Jacobson and Williams [32]. If, as noted 
by Lehmer [35], we put P = 2X, and Q = 1, we find that D = 4dY? and 


2Xn = Vi(P,Q), Yn =ViUn(P,Q). 


Thus, we can determine the properties of the solutions of (3.14) from the 
properties of the Lucas functions. Also, wherever it is possible to use the Pell 
equation to solve a given Diophantine equation, we will very likely need to 
appeal to the properties of the Lucas functions. As a simple example, suppose 
we wish to find a solution of the Pell equation 


x? —931y? = 1. (3.15) 
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We observe that 931 = 7? - 19; since the minimal solution of x? — 931ly? = 1 
is (170,39), we need to find a value of n such that 7 | U,,(340,1). Since 
€7 = (D|7) = (19|7) = —1 and (Q|7) = 1, we see by the law of appearance 
and Theorem 36 that n = (7 — €7)/2 = 4 will work and x = V4(340,1)/2 = 
6681448801 and y = 39U4(340, 1)/7 = 218975640 are solutions of (3.15). The 
theory of the Pell equation has attracted a great deal of literature, much 
of which is cited in [32] and Lenstra [39], to which the interested reader is 
referred. 

The Lucas sequences have also appeared as the subject of Diophantine 
analysis; for example, consider the problem of finding all the Fibonacci num- 
bers that are perfect integral squares. Credit is usually given to J. H. E. 
Cohn for first proving in 1964 that F, = Fy = 1 and Fig = 144 are the 
only square Fibonacci numbers. However, Wilhelm Ljunggren had solved the 
problem more than a decade earlier, using much more sophisticated and gen- 
eral methods. Unfortunately, although he published his papers in English, 
they appeared in Norwegian venues during and around World War IT and 
were not as well known as they should have been. More recently, the prob- 
lem of finding all F,, (and L,,) which are perfect integral powers has been 
solved by Bugeaud, Mignotte, and Siksek [17] using very deep and powerful 
techniques. (The only additional numbers are Fy = 8, L, = 1, and Ls = 4.) 
The paper [17] also has extensive bibliography which we recommend to the 
interested reader. 


3.7 Cryptography 


Cryptography is the practice of designing and analyzing techniques that as- 
sure secure communication in the presence of malicious adversaries. In the 
last 50 years, cryptography has been the object of considerable (unclassified) 
research by mathematicians, computer scientists, and electrical engineers. For 
an overview of this subject, the reader is advised to consult any number of 
introductory textbooks; in particular, we suggest the books of Buchmann 
[16] and Stinson [70]. Also, for a more detailed treatment, the comprehensive 
handbook of Menezes et al. [49] is highly recommended. As both [70] and 
[49] contain extensive bibliographies, we will refrain from including a lot of 
references in this section. 


Since the inception of the Internet, it has become customary to make use 
of public-key (or two-key) cryptography for securing Internet commerce. In 
such a scheme, each member of a group of individuals wishing to exchange 
information will have both a private key and a public key unique to that 
person. If Alice and Bob are members of such a group and Alice wishes to 
communicate with Bob, she looks up his encryption key in a public direc- 
tory and encrypts her message M to him using this key. On receipt of this 
ciphertext, Bob uses his decryption key to decipher it and produce M. As 
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an example of such a cryptosystem, consider the so-called RSA system. Each 
member of the group, say Bob (or a trusted authority acting on Bob’s be- 
half), selects two large primes p and q of k digits at random, keeps them 
secret, and computes N = pq. He also selects at random an integer e (< N) 
such that gcd(e, p(N)) = 1 (y(N) = (p — 1)(¢q — 1)) and solves the linear 
congruence 

ed=1 (mod y(N)) 


by the extended Euclidean algorithm to find d with 0 < d < y(N). Bob’s 
public encryption key is the pair (e, N), and his private decryption key is d. 
If Alice wishes to send a secure, numerically encoded message M (0 < M < 
N,gcd(M, N) = 1) to Bob, she computes C = M® (mod N) (0<C < N) 
and sends C’ to Bob. Bob can recover M from C’ by calculating 


CSM eM aM God 1), 


by Euler’s theorem. Since M < N, it is uniquely determined. This scheme has 
been the subject of many cryptographic attacks, but with some modifications, 
it has withstood them all and is still widely used today. See Boneh [10]. 


Of course, if an adversary can factor the RSA modulus N, then he can 
break the system. However, in spite of the many improvements to integer 
factoring algorithms since the announcement of the RSA system in 1977, fac- 
toring N when k = 1024, say, seems still to lie in the distant future. Of course, 
this statement could become invalid, should some group of clever individuals 
develop a new and better factoring algorithm or produce a universal quan- 
tum computer with a sufficient number of qubits. At this point, the latter 
scenario seems more likely. For information concerning quantum computers 
and computing, see Kaye et al. [33]. 


One of the problems associated with the RSA cryptosystem is the process 
of selecting p and qg. We note that because of the existence of the p— 1 and 
p+1 factoring techniques, it is essential for the security of the system that 
each of the four numbers p+ 1 and q+ 1 has at least one large prime factor. 
The problem of producing such p and q was first examined by Williams and 
Schmid [78], where the use of the primality tests of [14] was advocated. Since 
then, there have been further developments by Shawe-Taylor [64] and Maurer 
[48]. This latter paper is a particularly valuable contribution to this problem. 


In 1993, Smith [67] produced a public-key cryptosystem, called LUC, 
which was based on Lucas functions. “The basic idea behind LUC is that 
of providing an alternative to RSA by substituting the calculation of Lucas 
functions for that of exponentiation. While Lucas functions are somewhat 
more complex mathematically than exponentiation, they produce superior ci- 
phers.” Nevertheless, the system has some important weaknesses as pointed 
out by Bleichenbacher et al. [8]. However, it has since been recommended 
as a possible authentication system and continues to be the object of ac- 
tive research. (See, e.g., Ibrahimpasi¢ [30].) If we suppose that N is an RSA 
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modulus and gcd(QD,N) = 1, the basic idea behind LUC is the simple 
result, easily proved from the law of appearance, that Ujyiwy)(P,Q) = 0 
(mod N) and Vinyw) (P,Q) = 2 (mod N), where m is any positive integer 
and 7(N) = (p? — 1)(q? — 1). In this scheme, Bob computes 7)(V) and finds 
some positive e < N at random such that gcd(e, ~(NV)) = 1. As in RSA, the 
pair (e, NV) will constitute his public key. He next solves the linear congruence 


ed=1 (mod w(N)) 


for his private key d. (This is not exactly what is recommended in [67], but 
as mentioned in [8], it avoids the problem of message dependence.) 

For Alice to send a message M (< NV) to Bob, she first puts P = M and 
Q = 1 and computes C = V.(M,1) (mod N) and sends C' to Bob. Bob can 
recover M by computing Va(C,1) = M (mod N). To see why this works, we 
observe that 


Va(C, 1) = Va(Ve(M, 1), 1) = Vea(M, 1) = Vigewony(M, 1) =M_ (mod N) 
by (2.21) and (2.11). The latter congruence follows from 


2Vi 446 (ny(M, 1) = Vi(M, 1) Vinny (M, 1) + DUL(M, 1) U y(n) (M, 1) 
=2M (mod N). 


The values of C and Va(C, 1) can be computed quickly by using the techniques 
mentioned in Sect. 3.3. 

We have mentioned that if an adversary can factor N, then the RSA 
scheme (and LUC) can be broken. This leaves the question of whether break- 
ing RSA is equivalent in difficulty to factoring N. Boneh and Venkatesan [9] 
have provided evidence which suggests that this is not the case, and Boneh 
and Durfee [11] have shown that if d < N°, then N can be effectively 
factored. However, Aggarwal and Maurer [1] have proved that breaking RSA 
is equivalent in difficulty to factoring the modulus under a generic model of 
computation, but this is a very restrictive model, as it does not exploit the bit 
representation of elements except for testing equality. There is, nevertheless, 
a scheme somewhat like LUC for which it can be proved that breaking it 
is equivalent in difficulty to factoring N. This system, which is described in 
Sect. 14.2 of [32], makes use of the solutions of a certain Pell equation, which 
we have seen are essentially given by the Lucas functions. Further information 
on this system can be found in Miiller [55]. 
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Chapter 4 


Some Further Properties of the Lucas 
Sequences 


I have to change to stay the same. (Willem de Kooning) 
Knowledge comes from experience, all the rest is information. (Albert Einstein) 


Abstract This chapter is a compilation of some miscellaneous results involv- 
ing the Lucas sequences. We begin with a discussion of the relationship of the 
Lucas sequences to the sine and cosine functions and then describe the period- 
icity properties of U and V modulo an integer m. This is followed by a section 
devoted to the relationship of the Lucas sequences to the Chebyshev and the 
Dickson polynomials. We then review some additional arithmetic properties 
of the Lucas sequences that are not covered in Chap. 2 and establish what Lu- 
cas called his fundamental theorem. Subsequently, we provide some insights 
on the factorization of the terms of U. Next, we describe Lehmer’s extension 
of Lucas’ sequences and examine some of their properties. The following sec- 
tion makes use of our earlier factorization results to prove an old theorem of 
Carmichael on primitive divisors of (U,,) and of the Lehmer sequences. We 
conclude with a brief treatment of the topic of when the Lucas sequence U 
can contain a complete set of residues modulo an integer m. 


4.1 Connection with the Circular Functions 


Lucas was particularly struck by the similarity of his functions to the sine 
and cosine functions; in fact, in Sect. II of [16], he proved that if i is used to 
denote a root of #7 +1, then 


Un = (2Q"/?/V—D) sin[(ni/2) log(a/8)] 
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and 
Vn = 2Q”? cos|(ni/2) log(a/8)]. 
He did this by putting z = nlog(a/G)/2 in the well-known analytic formulas 


sin(iz) = —(e* —e-*)/2i and cos(iz) = (e* +e *)/2. 


As both sine and cosine are singly periodic functions with period 27, Lucas 
(Sect. XXVI of [16]) seemed to regard U,, and V,, as simply periodic numerical 
functions, where for any particular modulus m, the (numerical) period in this 
case is the least positive integer 7 = 7(m) such that both 


Unyr =U, (modm) and Vzi,=V, (mod m) 


hold. This explains the title of [16]. He considered the above formulas as the 
analogues of those conveying the periodicity of the circular functions: 


sin(e +27) =sing and cos(x+ 27) = cosa. 


Although Lucas was aware of the numerical periodicity of his functions mod- 
ulo m, he really only proved his above numerical periodicity results when m 
is a prime. We will now show by a very simple process that they hold for 
any m such that gcd(m,Q) = 1. Let (X,,) be any sequence satisfying the 
recursion relation (2.1), i-e., 


Xn42 = PXn4+1 = QXn for n Zz 0. 


If we put T; = (Xi, Xi41), (i 0, cy2 = -Q, c21 = 1, and c22 = P, then 
Ti41 = T;C, where C = (cj;)2x2. Note that C is the companion matrix of 
x*— Px+Q and the determinant of C, or det(C), is Q. It follows by induction 
that JT, = ToC”. We now assume that m is a fixed positive integer. Since 
there are only a finite number of residues of the integers modulo m, by the 
Dirichlet pigeonhole principle, there must exist some r (> 0) and s (> 0) 
such that C’t* = C* (mod m); hence, T,+; = Ts; (mod m). Notice that the 
values of r and s here are independent of the initial vector To. When det(C) 
is relatively prime to m, we see that C' is invertible modulo m and therefore 
C, =I (mod m), where I is the 2 x 2 identity matrix; in this case, we may 
put s = 0. Since for any 7 > 0, the vector T;4; (mod m) depends only on the 
vector T; (mod m), we must have 


Tntr=T, (modm)  foralln>s, 


and therefore, 
Xnir=Xpn (modm) foralln>s. 


We call the finite sequence (Xo, X1,...,Xr—1) a period of (X;,) modulo m 
and r the length of this period. 
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Let + > 1 (r = T(m)) be the minimal possible value for r. Suppose that 
for some w > 0, we have 


Xniw = Xn (mod m), 


for all sufficiently large n > 0. Let w = qr + p, where 0 < p < 7. We have 
Xntgqr+p = Xn (mod m), which means by definition of 7 that we must have 
Xn+p = Xn (mod m) for all sufficiently large n. If p > 0, this contradicts the 
definition of 7; hence, we must have 7 | w, and therefore, any period length 
of (X,) modulo m must be divisible by 7(m). When gcd(m, Q) = 1, we must 
have 


Xn4+r = Xn (mod m), 


for all n > 0. That is, the sequence (X,,) is purely periodic modulo m with 
period (Xo, X1,..., X;—1). Since this is true independent of the initial vector 
To, it must hold for both Tp = (0,1) and Top = (2,P), which shows that 
Lucas’ periodicity conditions hold for any m such that gcd(m, Q) = 1. Also, 
if X = U, we see on putting n = 0 above that the rank of appearance p of m 
must, by Theorem 8, be a divisor of 7. 


4.2 The Chebyshev and Dickson Polynomials 


The Chebyshev polynomials crop up in just about all areas of numerical 
analysis. In particular, they are very important in such subjects as orthogonal 
polynomials, polynomial approximation, numerical integration, and spectral 
methods. A broad, up-to-date treatment of them can be found in Mason and 
Handscomb [19]. Our principal concern in this section will be the connection 
between these objects and the Lucas functions. 


To begin, we recall the addition formulas for sine and cosine mentioned in 
Sect. 2.2. If we put « = n@ and y = @ and then y = —9@, we get 


sin(n+1)0 = sinn@ cos 6+cos nd sin 8, cos(n+1)@ = cos né cos O—sin nd sin 6 
and 

sin(n—1)@ = sin né cos —cosné sin @, cos(n—1)0 = cos né cos 0+sin né sin 0. 
On adding, we get 


sin(n + 1)6 = 2sinné cos @ — sin(n — 1)6, (4.1) 


cos(n + 1)0 = 2cosné cos 6 — cos(n — 1)6. 


We observe that since sin20 = 2sin@cos@, cos20 = 2cos?6 — 1, we can 
easily prove by induction that for any fixed nonnegative n, there exist integer 
constants a;, b; (i =0,1,2,...,n) such that 
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sin(n + 1)0 = sind > a;(cos@)', cosné = S- b;(cos 0)’. 
i=0 i=0 


It follows that for each nonnegative integer n, there exist two polynomials of 
degree n with integer coefficients, T,(x) and U;,(«), such that 


sin(n + 1)0 =sind-U;,(cos@), cosn@ = T,,(cos@). 


We call T,,(a) the Chebyshev polynomial of the first kind and U,(x) the 
Chebyshev polynomial of the second kind. Note that To(a) = Uo(x) = 1, 
T(x) = x, and Uj; (a) = 2x. Also, from (4.1), it is easy to deduce that 


Tn41(@) = 2aT,(@)—-Ty_-1(a) ands Uy 4.4 (a) = 22U,(x)—Up_1(x), (4.2) 


(n = 1,2,3,...). From the initial conditions on T,, and U, and (4.2), we find 
that 
2T,(z) = Va(2a,1), Un(x) = Un41(22, 1). (4.3) 


These are the relationships which connect the Chebyshev polynomials to 
the Lucas functions with P = 2x and Q = 1. Thus, any identity involving the 
Lucas functions can be used to find a corresponding identity which involves 
the Chebyshev polynomials. For example, from (2.8), we get 


Tp (a)? — (2? —1)U,-1(z)? = 1; 


hence, by the results in Sect.3.6, we see that the successive solutions of 
the Pell equation with d = x? — 1, starting with (z,1), are provided by 
(Tr(a), Un—1(@)) with n > 1. Also, since Vinn(P, Q) = Vn(Vm, Q™) by (2.21), 
we see that Tinn(@) = Tn(Tm(x)); this result is an important ingredient in a 
public-key encryption scheme promoted by Kocarev et al. [11]. By the first 
identity in (2.26), we have V,, = Un1+1 — QUn_1; hence, by (4.3), we get 


2T n(x) = Un (x) — Un—2(2). (4.4) 
Notice that the derivatives T/(x) = 1 = Uo(x) and T2(x) = 4a = 2U; (2). 
Suppose that T/ (a) = nU,-1(a) and T/_,(@) = (n — 1)U,-2(x); this is 


certainly true for n = 2. If we take the derivative with respect to x of the 
first formula of (4.2), we get 


Tr4i(&) = 2Tp (x) + 20T;, (2%) — Th_1 (2) 


= U, (x) — Un—o2(x) + 2anUp_1(x) — (n — 1)Up~-2(2) 
= U, (a) + n(2aU;,-1(x) — Un—2(2)), 


Se 


by our supposition and (4.4). By the second formula of (4.2), we find that 
T)41(@) = (n+1)U,(a); thus, we have proved by induction that 


Tia) =H pale). (S123), (4.5) 
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Some years ago, some use was made of the modified Chebyshev polynomi- 
als, C,,(x) and S;,(a); these were defined as 


Cr(a) = 2T (2/2), Sn(#) = Un(a/2). (4.6) 
We note from (4.3) that 
C(t) =Vn(a,1), Sn(*) = Unsi(x, 1); (4.7) 
hence, Co(«) = 2, So(x) = 1, Ci(a) = 2 = Si(x), and 
Cn4i(x) = 2Cpq(2) —Cr—1(2), Sn4i(t)=25n(2)—Spi(a), (4.8) 


for n > 1. Although these degree n polynomials are monic for n > 0 and 
have some nice algebraic properties, they never seemed to catch on with the 
numerical analysis community (see [19, p. 11]). Also, the S and C' notation 
seems to have been largely abandoned, and it is now customary to use (2 in- 
stead of C and V instead of S. Several properties and applications of 2,,(2) 
and V,,(x), together with references, can be found in Witula and Stota [40]. 
We will prove some results concerning them below, but because of possi- 
ble confusion between V,,(a) and V,(a,1), we will revert back to the earlier 
notation. We first note that 


Q(z + 27+) = C,(a +27") = Va(2 +274, 1) =a" +0; 


4.9 
Sn(e +a") = Ungi(e+a7',1) = (at a") /(@—2~"). oe 
By the Girard-Waring formula (2.37), we have for n > 0 
[n/2| oe Sk 
. = ay k n—2k 41 
cna) = So (ha (" 5 "Ye (4.10) 


By the first formula of (4.6) and (4.5), we can easily derive Cl, (a) = nS;,_1(2); 
hence, by differentiating (4.10), we can readily deduce that 


[n/2] 
sa(a)= ot" | har, (4.11) 


Note that by putting « = (a/8)"/? = a™/Q™/?, then (4.9) together with 
(4.10) and (4.11) can be used to derive multiplication formulas for the Lucas 
functions. Additional formulas can be found by putting x = a™/(—Q)'™/?; 
for example, when n is positive and n = 2r, we can use the second formula 
of (4.9), formula (4.11), and sum over j = r — k — 1 to obtain 


r—1 é 
r+j ipym(r-jr- j 
UVarm — Vin >, (, _ j +. . DIQ (v3 a, (4.12) 
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Other, similar formulas can also be derived in this way. See Sect. 4.2 of [39]. 
We also see from (2.21) and (4.7) that 


Crm(®) = Cm(Cr(2)). (4.13) 

The degree n polynomial G,,(a) introduced in Sect.3.3 satisfies the same 

recurrence as S,,(x). Thus, since 1 = Go(x) = So(x) + S_i(x) and x+1= 
Gi(x) = S,(x) + So(x), it is easy to see that 

G(x) = Sp(a) + Sp_-1(2), (4.14) 


for n > —1. By using (4.14) and (4.11), we find that 


Gn(x) = do(-e) (" - ear 
i=0 


Also, by (4.14) and the second formula of (3.8), we get 
G(x) + Gy_1(x) = (@ + 2)S,_-1(2), (4.15) 
and by (4.4) and (4.6), we have 
G(x) — Gy_1(x) = C(x); (4.16) 


from this result, it follows easily that we can write 
Gr(z) =1+ 5° C;(2). (4.17) 
j=l 


Since, by (2.16), we have 
Unim(Z, 1) = Via, 1)Um(z, 1) _ Um—n(2, 1), 
we can use (4.7) to derive 


Sntim—1(2) = Cr(£)Sm—1(2) a ee eee 
Snam(2) = Cr(2)Sm(x) — Sm—n(x). 
On adding these and using (4.14) and (4.16), we get the addition formula for 
G,, (2): 
Gmin(£) = Gn()Gm(£) — Gn—1(2)Gm(x) — Gm_n(x). (4.18) 
Particular instances of this formula are 


Gomta(t) = Gm4i(2)? — G@m4i(2)Gm(2) - 1, 


. (4.19) 
Gom41(@) = Gm41(£)Gm(x) — Gm(a)* +1. 
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Also, since Gyn_1(@) = UGm(x) — Gm4i(x), we have 
Gom(z) = Gmii(2)Gm(x) — (2 — 1)G,,(2)* - 1. (4.20) 


Formulas (4.19) and (4.20) can be used in the manner discussed in Sect. 3.3 
to produce a fast method to compute G;,(a) for large values of n; this, in 
view of (3.7) and (3.8), provides a rapid technique for computing the Lucas 
quotients V,,/V, and U;,/U, (mod N) when r is large and odd. 


We next turn our attention to the Dickson polynomials. These polynomials 
are generally used in the study of finite fields, where they provide many 
examples of permutation polynomials; such polynomials induce permutations 
on the finite field Fy of g elements. Much more concerning this topic can be 
found in Chap. 7 of Lidl and Niederreiter [14] and in Mullen and Panario [21]. 
We will confine our discussion here to the relationship between the Dickson 
polynomials and the Lucas functions. 


For a positive integer n and a fixed a € F,, we define the nth Dickson 


polynomial of the first kind D,,(z,a) over F, by 
[n/2] a ae 
Dy(x,a) = y ate | 


We define the nth Dickson polynomial of the second kind E,,(z,a) over F, 
by 


E,,(a, a) = > Cay (" r : Pes 


Notice that these polynomials are monic and of degree n over F,. From the 
definitions, we see immediately from (2.34) and (2.30) that 


Dy(a,a)=Vn(2,a), E,(x,a) = Un4i(2,a). (4.21) 


Many characteristics of the Dickson polynomials can be found in Lidl et 
al. [15], but in view of (4.21), several of these can be derived from the proper- 
ties of the Lucas functions. For example, we have Do(x, a) = 2, Fo(a,a) = 1, 
and D,(x,a) = a = E,(az,a), and from (4.21) and (2.1), we get the recursive 
formulas: 


Dyai(a,a) = cD, (x, a)—aDy_1 (2,0), Ensi(x,a) = cE, (x, a)—aEp_1(2, a). 


From (4.3), it is easy to see that the Dickson polynomials are related to the 
Chebyshev polynomials by 


Dn(2x,1) =2Tp(z), En(2z,1) = Un(2). 
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From the remarks in Sect.3.7 and earlier in this section, the above relation- 
ships suggest that the Dickson polynomials could be applied to public-key 
cryptography; indeed, this was done as early as 1981 by Miller and Nobauer 
[22]. For a more modern treatment of several of the two-key cryptosystems 
mentioned in this section and in Sect. 3.7, the reader is referred to Brandner 
[3]. 


4.3 Some Additional Arithmetic Properties of the Lucas 
Sequences 


In Chap. 2, we derived a number of arithmetic properties of the Lucas se- 
quences, but there are many more such results. In this section, we will detail 
some of these. We set out by observing that ifn, m > 1 and m is odd, then 
by Theorem 4, we have V,, | Vinn- Furthermore, by (2.36), we find that 


Veal, at—I rg God VY): 


hence, 
gcd(Vmn/ Vins Vn) | MQrM™—Y/2, 


There is a similar result for U. 


Theorem 60 If n,m > 1, we have 
gcd(Umn/Un,Un) | mQrer/., 
Proof If m is odd, we have from (2.38) 
UnifUn= mgr? (mod U,), 


and the result follows easily in this case. We now turn to the case of m even. 
If m = 2r, we get 


Umn/Un = rV,Qrer-) (mod U,,) 


by (4.12). It follows that gcd(Umn/Un,;Un) | gcd(rVr,Qr"-),Un). Since 
gcd(rV,Qr"—-)), U,) must divide rQ’"—) ged(Vn, Un), we see that ged(Umn / 
Un, Un) must divide 2rQ’™” by Lemma 4. The result now follows for even m. 


If U is regular (gcd(P,Q) = 1), then, by Lemma 3, we have 
gcd(Vinn/Vn, Vn) =1 and gcd(Umn/Un, Un) | m. 


These results seem not to have been known to Lucas; the latter was first 
proved for m a prime by Carmichael in [7, p. 51], using a different approach. 
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We next look at Lucas’ [16, Sect. XX VI] generalization of Euler’s y func- 
tion (see also the discussion in Sect. 2.6). For any U and any prime p such 
that p { Q, we define yy(p*) by yu(p*) = p*-!(p — e»), where ep is the 
value of the Legendre symbol (D | p). By our results in Sects. 2.4 and 2.5, we 
have p* | Uy, where f = yy(p*). If n > 1 and ged(n,Q) = 1, we define the 
multiplicative function yy(n) as 


gu (n) = | J vu(p*), 


where the product is taken over all p* such that p* ||n. Notice that if D is a 
perfect square and gcd(n, QD) = 1, we have yy(n) = y(n). 

For any positive integer n coprime to Q, we have n | Uy, (n), and therefore, 
by Theorem 8, we get p(n) | y(n). We observe further that if we define A(n) 
to be the least common multiple of yy (p") for all p* such that p* ||n, then 
we must have n | U,n) and p(n) | A(n). If n is odd and ged(n,QD) = 1, 
then 2 | yu(p*); thus, in this case, A(n) | gu (n)/2°—1, where t is the number 
of distinct prime divisors of n. 


We next establish the following technical lemma. 


Lemma 23 If n is odd, gcd(n,QD) = 1, and n has t > 2 distinct prime 
factors, then yy (n)/2'-! <n—-1. 


Proof Since t > 2, we have 


0 T(1-2) s(iet)(d)erfer 


it follows that yy(n)/2°~! < 4n/5 < n—1, because n has at least 2 distinct 
odd prime factors and must, as a consequence, be at least as large as 15. 


We are now able to prove the theorem whose proof gave Lucas so much 
trouble. 


Theorem 61 Suppose U is regular. If n is odd, gcd(n,Q) = 1, and p(n) = 
n—vy, where |\y| =1, then n is a prime. 


Proof Suppose n is composite and that gcd(n, D) > 1. We must have some 
odd prime g such that g | n and q | D. Since q | U;,—-,, we must have p(q) | n—y 
by Theorem 8. Also, by Theorem 9, we have p(q) = q, which is impossible 
because gcd(n,n — y) = 1. It follows that we must have gcd(n,D) = 1. 
Since p(n) | A(n) and A(n) < yy (n)/2*-! < n—1 by the lemma, we get a 
contradiction. 


The above result can be used to prove Lucas’ fundamental theorem mentioned 
in Sect. 3.2. 


Theorem 62 [f n is odd, U is regular, and n | Un—y, where |y| = 1, but 
nt{ Ug for all aliquot divisors d of n—7y, then n is a prime. 
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Proof Since U is regular, by Lemma 2, we have gcd(U,,, Q) = 1. Also, since 
n{ Ug for all aliquot divisors d of n — y, we must have p(n) = n — +. The 
theorem now follows from Theorem 61. 


We next concern ourselves with how the numbers in the Lucas sequence 
U can be factored. As pointed out in Sect. 3.5, the general problem of integer 
factorization is considered to be very difficult, but, as noticed by Carmichael 
[7], the Lucas functions U,, possess some distinctive properties, which allow 
them to be represented as a product of some special integers. In order to 
find this representation, we first consider the cyclotomic polynomials ®,(2), 
mentioned at the end of Sect. 3.5, in greater detail. We define ®,,(x) to be the 
unique polynomial with integer coefficients which divides x” — 1, but does 
not divide x* — 1 for any positive integer k < n. The roots of ®,(a) must 
therefore be the y(n) primitive nth roots of unity, and we can write 


$,(x) = | [(«-¢*) (4.22) 
where ¢ is some primitive nth root of unity (e?™’/” is such a root) and the 


product is taken over all integral k > 1 such that gcd(k,n) = 1 and k < n. 
It follows that ®,,(x) is monic and its degree is y(n). We also see that since 


S> o(d) =n, (4.23) 
d|n 
we get 
x" -1=|]4,(2), (4.24) 
d\n 


where the product is taken over all the positive divisors d of n. Further- 
more, Gauss showed that the cyclotomic polynomials are irreducible over the 
rationals (see, e.g., Weintraub [37]). 

Let js denote the Mébius function defined by j(1) = 1 and for n > 2 


p(n) = 1, if n is square-free with an even number of prime factors. 


p(n) = —1, if n is square-free with an odd number of prime factors. 
p(n) = 0, if n has a squared prime factor. 


By Mobius inversion applied to functions written as products, we find from 
(4.24) that 


5, (0) = Tf (e? 1" = To = 10 = Ta 2/44, (4.28) 
dln dln d|n 


where, as before, the product is taken over all the positive divisors d of n. 
The last equality in (4.25) follows from }7q),, 4(d) = 0 when n > 1. For more 
information concerning ®,(2) and examples, see Appendix 6 of Riesel [25]; 
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results involving the Mobius function can be found, for example, in Sect. 3.5 
of Shapiro [30]. 

We now define the sequence of numbers (Qn)n>1, called the cyclotomic 
numbers associated with U,,(P,Q), by putting Q; = 1 and for n > 2, 


Qn = Qn(a, 8) = [[(e— ¢*8), (4.26) 


where ¢ is some primitive nth root of unity and the product is taken over all 
integral k > 1 such that gcd(k,n) = 1 and k < n. For example, Q2 = a+ = 
P and Q3 =a? + a6 + 6B? = P? — Q. By (4.22), we have for n > 2 


Qn = Qn(a, B) = 8° By (a/B). (4.27) 


Thus, Qn(a,@) can be regarded as a polynomial in a and ( with integral 
coefficients. If we put « = a/f in (4.22), we see by (4.27) that 


(a/8)” —1 = (a/8 -1)Qi J] Ba, 


where the product is taken over all divisors d > 2 of n. By (4.23), we find 
that 
Un = || Qe. (4.28) 


Notice that ifn > 2 and1<k<n,then1<n—k <n and gced(k,n) = 
gcd(n — k,n). It follows that n | }>k, where the sum is taken over all integral 
k > 1 such that gcd(k,n) = 1 and k < n. Thus, if n > 2, we have 2 | y(n), 
and we can write (4.26) as 


Qn(a, 8) = [[(e- ¢*8) = [J ¢*8 - &) = [][(6- ¢*e) 
=] [(@-¢"*a) = [](6 - ¢*a) = Qni(8, a). 


Since Q,(a@, 3) is an integral polynomial in a, 3 and symmetric with respect 
to a, @, we see that for any integers P and Q, we must have Q,, an integer 
for n > 1. Hence, the formula (4.28) expresses U,, as a product of integers. 

At this point, we should mention that Lucas [17, 18] observed that certain 
formulas for ®,,(a), which he attributed to Aurifeuille and Le Lasseur, could 
be used to derive identities involving the Lucas sequences and for factoring 
some of these expressions. For example, in [18], he gave the identity 


Vion/Van = (Van + 5Q”Van + 7Q?")? — 10Q” (Van + 2Q"Vp)?. 
He derived this from the Aurifeuillian formula 


719 4 4 


= (a* + 5a? + 7a? + Sa + 1)? — 10a(2? + 20? + 20 + 1), 
ee Il 


Poo (x) => 
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by putting z = a"/8", where a+ 8 = P and af = Q. Notice that this 
formula for Vion/V» will factor as a difference of squares when 10Q” is a 
perfect square. This means that n should be odd and Q = 10L?, where L is 
an integer. 

Other results, apparently not known to Lucas, can be found by substitut- 
ing other expressions involving a” /8” for « in Aurifeuillian formulas. As an 
example, consider 


_ a1 


=a = (2? +32 +1)? —5a(24+1), 


a formula known to Lucas. If we put « = (—a”/8")'/?, we get 


Von 
Vn 


= (Vi — 5Q")? + 5Q”"DU;, 


= (DU2 - Q”)? + 5Q"DU?. 


If —5Q”D is a perfect integral square, we have a factorization of Vsn/Vn.- 
Suppose we have integers M, L, and S such that 


M? +41? =5S?, 


and put P = M and Q = —L?. We find that D = 5S? and —5Q"D = 
—25(—L?)", a perfect square when n is odd. If we consider the special case 
of M = L=S =1, we get the pretty identity found by Jarden [10, Sect. 8): 


for an odd n. This result was enlisted by Brillhart, Montgomery, and Sil- 
verman [4] to assist in compiling a table of integer factorizations of Lucas 
numbers. It has been generalized recently by Roettger and Williams [26]. 


For recent information concerning Aurifeuillians and additional references, 
the reader is advised to consult Granville and Pleasants [9] and Wagstaff [34, 
Sect. 4.1]. For some historical commentary, see [39, pp. 126-127, 318-319]. 


Let N be any odd positive integer. In Sect. 3.2, we defined N to be a base 
a pseudoprime if N is composite and satisfies gcd(a, N) = 1 and (3.1). It isa 
remarkable fact that there exist composite N which satisfy (3.1) for all a such 
that ged(a, N) = 1; such numbers are called absolute pseudoprimes or, more 
frequently, Carmichael numbers. It has been known since 1899 (Sect. 3.3.2 of 
Crandall and Pomerance [8]) that a positive integer N is a Carmichael number 
if and only if N is composite and square-free and such that p—1 | N-—1 
for each prime p which divides N. For example, N = 561 = 3-11-17 is the 
smallest Carmichael number. These numbers have been tabulated by Pinch 
[24] up to 1071. 
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We also mentioned in Sect.3.2 the definition of a Lucas pseudoprime; we 
can extend this to a Lucas function analogue of the Carmichael numbers as 
follows: 

For a fixed D, we say that a composite N is an absolute Lucas D- 
pseudoprime if for all P and Q such that ged(P,Q) = gcd(N,QD) = 1 
and P? — 4Q = D, we have 


Un—-e(ny(P, Q) =0 (mod N), 


where e(V) is the value of the Jacobi symbol (D | N). In Williams [38], it 
was shown that if for a fixed D, N is an absolute Lucas D-pseudoprime, it is 
necessary that N be square-free and p—e, | N —e(N) for each prime divisor 
p of N. Since p(p) | p—€p , these conditions are also clearly sufficient for N to 
be an absolute Lucas D-pseudoprime. Thus, we see that an absolute Lucas 
1-pseudoprime must be a Carmichael number, and therefore, the absolute 
Lucas D-pseudoprimes represent a special case of the Carmichael numbers. 
Very recently, Webster and Gurovski [36] have computed all the absolute 
Lucas 5-pseudoprimes up to 10". It is still unknown whether there exist any 
N and D such that (D | N) = e(N) = —1and N is both a Carmichael number 
and an absolute Lucas D-pseudoprime. It is shown in [38] that such values 
of N must be the product of an odd number of distinct primes p satisfying 
p—1|N-—1andp+1|N-+1. A search up to 10!° conducted by Pinch 
[23] did not produce such an N. More recent work of McIntosh and Dipra 
[20] extended this bound to 2 x 10'°, but still found no value for N. Many 
refinements of the ideas presented here, together with several references, can 
also be found in [20, 23] and Shallue and Webster [29]. 


4.4 The Lehmer Sequences 


Suppose R, Q 4 0 and gced(R, Q) = 1. In [12], Lehmer extended the regular 
Lucas sequences by replacing the integer P by VR, where R is an integer. In 
this case, we have D = R— 4Q, which means that D can be 2 or 3 (mod 4), 
something that is impossible for the Lucas sequences. Of course, if R is not 
a perfect square, then we see from (2.1) that U,(WR, Q) and V,,(vR, Q) are 
not always going to be integers for n > 1. If we define 


Un(R,Q) =Un(VR,Q), Va(R,Q)=Vi(VR,Q)/VR (2 tn) 
and 


then it is easy to see from (2.18) that both (U,,(R, Q)) and (V,(R, Q)) satisfy 
the fourth-order recurrence 
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Xn44 = (R- 2Q)Xnt2 — Q°X. 


Since Up(R,Q) = 0, Ui(R,Q) = U2(R,Q) = 1, U3(R,Q) = R-Q, and 
Vo(R,Q) = 2, Vi(R,Q) = 1, VA(R,Q) = R - 2Q, V3(R,Q) = R- 3Q, 
we see by induction that U,, (VR ,Q) and V,,(VR, Q) will always be integers 
whenever n > 0. 

We call the sequences U = (Un(R,Q)) and V = (V,(R,Q)) the Lehmer 
sequences. Since the identity relations given in Chap. 2 hold for non-integral 
values of P and Q, it is possible to derive several properties of these sequences 
which are analogous to corresponding results concerning the Lucas sequences. 
For example, we easily find from (2.23) and (2.24) that 


Van R, Q) (R, oy — 2Q", if 2| n; 
Von(R,Q) = RVn(R,Q)? — 2Q", if 2f n; 


=V,(R 
=V18 


From the identity (2.8), it is easy to derive 
RV? — DU? = 4Q", 


whenever n is odd, and 7 7 
Vi — DRUZ = 4Q”, 


whenever n is even. Indeed, if s denotes an odd integer, we can produce the 
additional identities 


V2 = DU aU = O*, 
DU? — V.-1Ve41 = —RQ*", 


which will be useful in Sect.8.5. These follow from the easily verified Lucas 
sequence identities 


V2 — DUn+iUn-1 = P?Q”* = Va-1Vn4i — DUZ. 


Lehmer was also able to show that U is a strong divisibility sequence and 
if w = w(m) is the rank of appearance of the integer m in U, then w | n 
whenever m | U,(R, Q). Using (2.28) and (2.29), he also showed that if p is 
an odd prime and p{ Q, then 


Up =(D|p)=e& (mod p), Vv, =(R|p) (mod p). 


By using (2.10), (2.12), (2.11), and (2.13), with m = p and n = 1, and the 
above congruences, he deduced that 


p|Up—eo(R,Q) and V,-co(R,Q) = 20QC-©/?_ (mod p), 
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where € = €, := (D | p) and o = op := (R | p). It follows that w(p) | 
p—eo. From these results, we can produce a law of appearance for U;,(R, Q) 
whenever p { RQ, and if p | R, pt Q, we get p | U2,(R,Q). Lehmer also 
proved a result similar to Theorem 2.10.1. 


Theorem 63 Define T = Tt, := (Q|p). If p is an odd prime and p { DQR, 
then p | Vip—ce)/2 if and only if o = —T. 


With these findings, he was able to prove Theorem 50 (the Lucas-Lehmer 
test) by putting R = 2 and Q = —1. (See Theorem 8.4.9 in [39].) 


We can also derive results concerning the V = V(R,Q) sequence which 
are analogous to those proved for the V sequence in Chap. 2; for example, 
gcd(V;,,Q) = 1, ged(Un, Vn) = 1 or 2, and V;, | Vim whenever m is positive 
and odd. Indeed, by using the same kind of reasoning as that employed in 
Sect. 2.7, we can show further that if p is some odd prime such that p{ Q and 
x (= x(p)) (if it exists) is the least positive value of n such that p | V,, then 
w(p) must be even and x = w/2. Also, if pt RQ and p| Vm, then y | m and 
m/x is odd. Thus, if pf DQR and yx(p) exists, then y(p) must be a divisor 
of (p — eo) /2. 

For any odd prime p such that p{Q and any positive integer k, we define 
W(p*) = p*-'(p — (RD|p)). For any n > 1 and gcd(n,Q) = 1, Lehmer 
defined his generalization T(n) of Euler’s totient function as 


T(n) = 2] [ Wor), 


where the product is taken over all p* such that p* ||n. By the laws of ap- 
pearance and repetition for the Lehmer sequences, we have 


Ur(n) =0 (mod n), 


for any positive integer n such that gcd(n, Q) = 1. Lehmer also defined S(n) 
to be the least common multiple of the factors 2 and W(p*) of T(n) and 
noted that n | Us(n); therefore, w(n) | S(m) for any positive integer n such 
that gcd(n, Q) = 1. Thus, by using the methods of Sect. 4.3, we can prove a 
result completely analogous to Theorem 61 for the Lehmer sequence U. 


4.5 The Primitive Divisor Theorem 


Let X = (Xy)n>o0 be any integer sequence. If p is a prime such that p | X,, 
but p{ X for any m < n, we say that p is a primitive divisor of X,. As 
an example, consider the Fibonacci sequence, where we get F,, = 55 when 
n = 10. Since the aliquot divisors of 10 are 1, 2, and 5 and Fy = Fy = 1 
and F; = 5, we see that 11 must be a primitive divisor of Fg. Indeed, if U 
is a Lucas sequence and n > 1, we see that p is a primitive divisor of U, 
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if and only if p(p) = n. Notice that it is not always the case that any U,, 
with n > 1 will have a primitive divisor. For example, consider Fig = 144; 
there are only two primes 2 and 3 which divide this number and p(2) = 3 
and p(3) = 4 for the Fibonacci sequence. For a given U, it is of some interest 
to determine those values of n such that U, contains a primitive divisor. 
For a general Lucas sequence, this turns out to be a very deep and difficult 
problem which has only recently been solved. However, in the case of D > 0, 
the problem was solved by Carmichael [7] in 1913. Yabuta [41] has provided 
an elementary and relatively simple proof of Carmichael’s result, which we 
will sketch below. We will assume in what follows that U is regular. Since we 
are dealing with primes that divide some term U,, with n > 0, we may also 
assume that p{ Q. We first observe that by using (2.1), it is easy to establish 
by induction on n that 


for n > 0. Thus, whatever the sign of D, we may always assume that P > 0. 
By (4.28), it is easy to see that p is a primitive divisor of U,, if and only if 
p is a primitive divisor of Q,,. We now prove some preliminary lemmas. 


Lemma 24 Let p be a prime such that p divides U, withn > 0. If p | Qn, 
then n = p(p)p", where k is a nonnegative integer and p(p) > 0 is the rank 
of appearance of p in U. 


Proof Since p | Un, by Theorem 8, we have n = rp(p)p*, where p { r and 
k > 0. Put m= p(p)p* and let p” || Um. Since p(p) | m, we must have v > 1; 
by Corollary 11 and results in Sect. 2.5, we see that 4p { U,/Um. From (4.28), 
we also see that Qn | Un/Um when r > 1, but since p|Qn, this is impossible. 
It follows that we can only have r = 1 and n = p(p)p*. 


Lemma 25 Let p(p) > 0 be the rank of appearance of p in U. For any positive 
exponent r, we have p|| Qn, where n = p(p)p", except when p = 2, p(2) = 3, 
andr =1. 


Proof For any fixed r > 1, put m = p(p)p"~', and let p(p) = p*t, where p { t 
and s > 0. We have pm = p"**tp(p). From (4.28), we get 


Upm/U, = ]] Gee. 


dt 


where u = pt’. If p 4 2, by Corollary 11, we have p|| Upm/Um; thus, p|| Qju 
for some 7 such that 7 | t. By Lemma 24, we have p(p) | ju, and therefore 
t | jp”, which since p { t means that t | 7 and, since j | t, we get t = 7. It 
follows that if n = ju, then n = tu = p(p)p" and p||Qn. If p = 2, we can 
prove the lemma and the exception by using results derived in Sect. 2.5. 


Let sqf(n), sometimes called the square-free kernel of n, be the product of 
all the distinct primes which divide n. For example, sqf (360) = 2-3-5 = 30. 
We are now able to prove the following lemma. 
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Lemma 26 [fn #1, 2, or 6, a sufficient condition that U,, contains at least 
one primitive divisor is that |Qn| > sqf(n). 


Proof We prove the contrapositive. If U,, contains no primitive divisors, then 
Q,n contains no primitive divisors. Thus, if any prime p divides Q,,, then p 
must divide some Q,, for some minimal m such that 0 < m < n. It follows 
that by Lemma 24, we have m = p(p)p* for some k > 0 and n = p(p)p” for 
some r > 0. Since n > m, we must have r > k+1 > 1. Thus, p | n, and 
by Lemma 25, we see that p? { Q,. Hence, Q, divides sqf(n) and therefore 
lQn| < saf(n). 


We now need to determine when the condition of |Q,,| > sqf(n) holds. We 
will assume that n > 2 is fixed in what follows. We observe that if ¢ is a 
primitive nth root of unity, then 


(a — ¢*8)(a— ¢-*8) = a? + 6? — oB(C* + ¢-*) = P? — QO, 


where 6, = 2+¢" + ¢-* = 24 2cos(2rk/n). Notice that if gcd(n,k) = 1, 
then 0 < 0, < 4. From (4.26) and the remarks following (4.28), we see that 


Qn = [][(P? - 26), (4.30) 


where the product is taken over all integers k such that 0 < k < n/2 and 
gcd(n, kh) = 1. 


Lemma 27 If D > 0, then Q,y, attains its least value when P = 1 andQ = -1 
or when P = 3 and Q = 2. 


Proof Consider any fixed 6, in (4.30). If Q < 0, then since we may assume 
P > 1, we get P? —Q6, > 1+, with equality only when P = 1 and Q=-1. 
If Q > 0, then clearly Q = 1 or Q > 1. In the first case, because D > 0, we 
must have P > 3 and P? — Q6; > 0— 6, > 9 — 20, > 0. In the second case, 
we get 


P? — Q6, > 4Q +1—-— QO, = 9 — 20; + (Q — 2)(4—6,) > 9 — 26, 


with equality holding only when Q = 2 and P = 3. The lemma now follows 
from (4.30). Thus, when D > 0, we need only check that Q,, > sqf(n) when 
U is either the Fibonacci sequence or the Mersenne sequence. 


In order to deal with these two possibilities, we make use of the lemma 
below: 


Lemma 28 If n > 2 anda is any real number such that |a| < 1/2, then 
@,(a) > 1—|al — |al?. 


Proof Since for any real x such that || < 1, we have 1 — x > 1 — |2| and 
(1—2)-! > 1—|a|, it is clear that (1—a”/¢)# > 1—|a|"/¢. By (4.25), we get 
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®,,(a) > [](1 — |a|"), where the product is taken over all positive integers i. 
Since for real x and y such that 0 < 2,y < 1, we have (l—2)(1—y) > 1-2-y, 
we find that 


jal? 
1— |a| 


®,,(a) > (1-la|)(1-|a|’—]a|°—|al*—---) = (1-al)( ) = 1-|a|—|a)?. 


We next deal with the Fibonacci case in the following lemma. 


Lemma 29 [fn 4 1, 2, 6, or 12, then for the Fibonacci sequence, we have 
Qn > sqf(n). 


Proof In this case, we put a = (1+ V5)/2 > 3/2, B = (1 — V5)/2, and 
|8/a| = (3 — V5)/2 < 1/2. By the previous lemma, we get ®,(G/a) > 
i |B/al| |8/a|? = 2/5 — 4> 2/5, and since Q;, = Qn(a, 8) = Qn(8, a) = 
a, (B/a) by (4.27), we find that Q, > (2/5)(3/2)"™. Thus, if 


(2/5)(3/2)°™ > sqf(n), (4.31) 


we are done. 

Let sqf(n) = qigo---q@, where the q (i = 1,2,...,¢) are distinct primes. 
Suppose qj ||n and put r = y(q/) and m = n/q{; we note that y(n) = ry(m). 
We will now show that if 


(2/5)(3/2)" > 2q1, (4.32) 


then (4.31) holds. This is certainly the case form = 1 or m = 2. Thus, 
we may now assume that m > 3; in this case, we have y(m) > 2. We next 
observe that 2/5 > (2/5)?(™ and that 


of3y™.. fo 73\\ 


where u = sqf(n)/q,. If u is odd, then by Euler’s theorem, we have 2°) = 1 
(mod wu); consequently, 2°) > u. If u is even, then u/2 is odd, and 2°™) = 
g°(u/2) = 1 (mod u/2); hence, 2°“)+! > u. Since u = sqf(n)/q, we can 
deduce (4.31) from (4.33). 

Now, if k is any integer greater than 10, it is easy to show by induction 
that (2/5)(3/2)*-1 > 2k; thus, if q, is larger than 7, then (4.32) is true, and 
therefore, (4.31) holds. Also, if gf = 24, 3°, 57, 77, then (4.32) is true. Hence, 
(4.31) holds for higher powers of these primes. It remains, then, to consider 
only the cases of n = 2%3°5°74, where 0 < a < 3,0<b<2,0<c< 1, 
and 0 < d < 1. By substituting these values of n into the left-hand side of 
(4.31), we find that the inequality (4.31) is true except for n = 1, 2, 3, 4, 
5, 6, 7, 8, 9, 10, 12, 14, 15, 18, 30. By direct computation of Q,, for each of 
these exceptional subscripts, we find that Q,, > sqf(n) for n # 1, 2, 3, 5, 6, 
12. Since Ff’; (= 2) and Fs (= 5) each have a primitive divisor, we have our 
result. 
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The following result can be proved for the Mersenne sequence. 


Lemma 30 Jf n 4 1, 2, 6, then the nth term of the Mersenne sequence 
contains at least one primitive divisor. 


Proof In this case, we have a = 2 and 6 = 1 and, by Lemma 28, 
@,(B/a) > 1/4. It follows that Qn > (1/4)2°™ > (2/5)(3/2)"™, with 
the latter inequality holding for n > 2. The remainder of the proof follows 
by using reasoning similar to that employed in the proof of Lemma 29. 


We are finally able to prove Carmichael’s theorem. 


Theorem 64 (Carmichael [7]) [If D > 0 andn £1, 2, 6, then Up, contains 
at least one primitive divisor except for P= 1, Q = —-1, andn= 12. 


Proof By Lemma 27, we know that the least value of @,, can only occur 
when U,, = F, or M,. By Lemmas 29 and 30, we know that in these cases, 
we have Q, > sqf(n) when n # 1, 2, 3, 5, 6, 12. Thus, by Lemma 26, we 
see that U,, contains at least one primitive divisor when n 4 1, 2, 3, 5, 6, 12. 
We compute Qs = 5 for F, and Qs = 31 for M,; hence, by Lemma 27, we 
have Qs > 5 when U, 4 F,. Also, if U, = Fp, then 5 is a primitive divisor 
of Fs = 5. Since D > 0, we have Q < P?/4 and Q3 = P? —Q > 3P?/4>3 
when P>1.If P=1andQ < —2, we get Q3 > 3, andif P= 1 and Q = —2, 
we get Q3 = 3; in both cases, 3 is a primitive divisor of U,. If P = 1 and 
Q = —1, we have the Fibonacci case, and 2 is a primitive divisor of F3. There 
remains the case of n = 12. Since Qj2 = 13 for the Mersenne numbers and 
Qi2 = 6 for the Fibonacci numbers, we have Qi2 > 6 when Uy, #4 Fy. We 
have already seen that F)2 does not contain a primitive divisor. 


In 1955, Ward [35] showed that a similar result holds for the Lehmer 
sequences U. 


Theorem 65 Jf D > 0 and n £ 1, 2, 6, then U, contains at least one 
primitive divisor except for R= 1, Q = —-1, andn = 12 or R=5, Q=1, 
and n = 12. 


Proof This can be shown by using reasoning similar to the above. An impor- 
tant observation is that if nm £1, 2, 6 and the cyclotomic number associated 
with U,(R,Q) for both R = 1 and Q = —1 and R=5 and Q = 1 is greater 
than sqf(n), then all Lehmer sequences U,, contain at least one primitive 
divisor. 


As mentioned earlier, it is very difficult to find all the U, and Uy, which 
have primitive divisors in the case of D < 0. This problem was only com- 
pletely solved relatively recently by Bilu et al. [2], where is it shown that for 
n > 30, any Lucas or Lehmer number must have a primitive divisor. This 
allowed the authors to present all U,;, and U,, which do not have a primitive 
divisor. While [2] is the work of three authors, it must be emphasized that it 
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represents the culmination of many years of previous work by several inves- 
tigators. As this material would take us well beyond the scope of this work, 
the interested reader should consult [2] for further information. This paper 
is very clearly written and contains an extensive bibliography. 


4.6 Moduli for Which U Contains a Full Set of Residues 


As in Sect. 4.1, let (X,,) denote any second-order linear recurrence with char- 
acteristic polynomial x? — Px + Q, where P and Q are integers such that 
gcd(P,Q) = 1. Following Shah [28], we will say that a given positive in- 
teger m is defective with respect to (X,,) if a complete set of residues 
modulo m does not exist in (X,,); otherwise, we say that m is nondefec- 
tive with respect to (X,,). For example, consider the Fibonacci sequence 
(Ff), where F, = U,(1,—-1). If we take this sequence modulo 5, we get 
0,1,1,2,3,0,3,3,1,4,...; thus, 5 is nondefective in (F;,,) because a complete 
set of residues {0,1,2,3,4} modulo 5 occurs in (F;,). Indeed, we have the 
more general result below. 


Theorem 66 Let p be any prime such that p | D, where D = P? — 4Q. If 
U, = U, (P,Q), then p is nondefective with respect to (Un). 


Proof If p = 2, then because Up = 0 and U; = 1, we see that p is nondefective 
with respect to (U,); thus, we may now assume that p is odd. By (2.28), 
we have Up, = m(P/2)™~+ (mod p) for all m > 0. Also, since p | D and 
gcd(P,Q) = 1, we see that p { P. If we let k be any integer such that 
0<k<p-—land put m= p*?—kp+k, then p—1|m-—1, and for this value 
of m, we have 

Um =k (mod p). 


It follows that (U,,) must contain a complete set of residues modulo p. 


On the other hand, if we take (F;,,) modulo 8, we get the repeating sequence 
0,1,1,2,3,5,0,5,5,2,7,1,... Since the residue 6 (mod 8) does not occur in 
this sequence, we see that 8 is defective in (F;,). With regard to the Fibonacci 
sequence (F;,), Burr [5] proved the theorem below. 


Theorem 67 The integer m is nondefective with respect to (F,) if and only 
ifm = 5*, 2.5", 4.5", 37.5", 6-5", 7-5", or 14-5*, wherek > 0 andj > 1. 


The purpose of this section is to investigate the question of when a given 
m can be nondefective in (X,,). The following simple result is essentially due 
to Shah. 


Theorem 68 If m is defective with respect to (X,), then every positive mul- 
tiple of m is also defective with respect to (Xn). 
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Proof Suppose tm is not defective with respect to (X,); then for every r 
where 0 < r < m—1, there exists some X,, such that X, = r (mod tm). 
Since X,, =r (mod m), we see that m is not defective, a contradiction. 


In what follows, we will assume that gcd(Q,m) = 1; this, of course, is 
automatic when |Q| = 1. We have a somewhat surprising result mentioned 
in Somer and Kiizek [33]. 


Theorem 69 If m is nondefective with respect to (X»,), then m is nondefec- 
tive with respect to (U,). 


Proof By results in Sect. 4.1, we know that (X,,) is purely periodic modulo 
m; thus, we may assume with no loss of generality that Xo) = 0 (mod m). 
Put d= gced(X,,m), and note that since d | Xo, we must have d| X,, for all 
n > 0 by the recursion for (X,,). If d > 1, then since X, # 1 (mod m), we 
see that m is defective with respect to (X,,), a contradiction. Thus, d = 1 
and X; =c (mod m), where gced(c,m) = 1. Once again, by the recursion for 
(X,), we find that 
U,=c 1X, (mod), 


for all n > 0, where c~tc = 1 (mod m). Thus, m is also nondefective with 
respect to (U;). 


In view of Theorems 68 and 69, we see that it is important to examine the 
question of whether a prime p can be nondefective in (U,). 


Theorem 70 If p is any odd prime such that the Legendre symbol (D|p) = 1, 
then p is defective with respect to (Un). 


Proof By Lemma 9 and Theorem 9, we have U, = 1 and U,_; = 0 (mod p). 
It follows from the recurrence (2.1) that 


Up-14n =U, (mod p) (n ea 0). 


Thus, the period length 7 of (U,,) must be such that + < p— 1. Since there 
can be at most p— 1 (< p) distinct elements in the periodic part of (Un) 
modulo p, we see that p must be defective with respect to (U;). 


The situation when (D|p) = —1 is more complicated. We know that in 
this case (Lemma 9 and Theorem 9), we have U, = —1, Up41 = 0, and 
therefore Up42 = Q (mod p). We also have the easily verified (by simply 
using the formula for U,, in (2.5)) Lucas sequence identities 

Un+m — Un+1Um —_ QU,Um-1 (n 2 0, m = 1); and 
Q™Un—m = UnUm+i —— Un+1Um (n Z=>m-s> 0). 


Both of these identities are special cases of the more general 


UU nim = UnUm+k =a OU Une: 
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given as (4.2.14) in Williams [39]. Putting n = p and m =r +1 in the first 
of these yields 
Up+itr = QU,. (mod p) (r 2 0) (4.34) 


and putting n = p+1,m =r in the second yields 
Q’Up41-r = —QU, (mod p) (r= 0). (4.35) 
By iterating (4.34), we get 
Un(p+1)t+r = Q*U, (mod p) (k>0, r>0), (4.36) 
and by using (4.36) together with (4.35), we find that 
Ux(p+1)-r = —Q*-"U, (mod p) (k>1, r>0). (4.37) 


From these two results, we see that for r = 1, 


Ve(pt y+ = OF O(k41)(pt1)—r = -Q (mod p) (k 2 0). (4.38) 
Theorem 71 Suppose that p is an odd prime and (D|p) = —1. If Q belongs 
to the exponent p—1 (mod p), then p is nondefective with respect to (Un); if 
Q belongs to the exponent (p—1)/2 (mod p) and p= —1 (mod 4), then p is 
also nondefective with respect to (Un). 


Proof If Q belongs to the exponent p—1 modulo p, then Q is a primitive root 
modulo p. By the first result in (4.38) and Up = 0, we see that (U,,) contains 
a complete set of residues modulo p. If Q belongs to the exponent (p — 1)/2 
(mod p) and p = —1 (mod 4), then no power of Q can be —1 modulo p. 
Hence, by both results in (4.38), we see that (U,,) must contain a complete 
set of residues modulo p. 


Now suppose p is a prime such that p = 2q¢+ 1, where q is an odd prime. 
Such primes are called Sophie Germain primes, and it is widely believed by 
analytic number theorists (but not yet proved) that there exists an infinitude 
of such primes. Indeed, if P(x) denotes the number of Sophie Germain primes 
< a, then it follows by Hypothesis H of Bateman and Horn [1] that P(x) ~ 
2Cx/(logx)*, where C = 0.6601618---, and it has been shown that this 
estimate is quite accurate for x up to 10'°. (See Sect. 3.5 of Caldwell [6}.) 
For such a prime p, we see that if p{ Q and |Q| 4 1, then Q belongs to the 
exponent p— 1 or (p—1)/2 (mod p). Also, by a result like Theorem 5.3.1 of 
[39], we know that for each value of Q, there exist precisely (p — (Q|p))/2 
values of P modulo p such that ((P? — 4Q)|p) = (D|p) = —1. Thus, by 
Theorem 4.4.6, there must exist many pairs (P,Q) modulo p such that any 
Sophie Germain prime p is nondefective with respect to the corresponding 
(U,,). As a consequence of this, it is very likely that if |Q| > 1, then there 
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exist infinitely many primes p such that p is nondefective in some (U,,). This 
strongly suggests that the problem of determining all nondefective primes 
with respect to a given (U,,) becomes extremely difficult to manage when 
|Q| 4 1. It is for this reason that we assume that |Q| = 1 in the sequel. We 
now deal with the case of an odd prime p such that (D|p) = —1 and Q = 1. 


Theorem 72 If p> 5 is a prime such that (D|p) = —1 and Q = 1, then p 
is defective with respect to (Un). 


Proof By (4.34), we have Up4i+n = Un (mod p) (n > 0). Thus, (U,) is 
periodic modulo p with period length p+ 1. Put r = (p+ 1)/2, and observe 
from Theorem 36 that p divides U,. From (2.10) with m =r, we get 
Q2Un+r = UnVr (mod P); 
also, from (2.8), we have V,? = 4 (mod p). Thus, V;. = 27 (mod p) and 
Urtn = Un (mod P); (4.39) 


where 77 is either 1 or —1 and fixed. We also see from (2.10) that 


2Uy = 2Un—rtr = Un-rVr = 2NUn—r (mod p). 


By (2.7), we have 
Uy, =—nU;-—n (mod p). (4.40) 


It follows from (4.39) and (4.40) that the only possible distinct terms in the 
period of (U;,) (mod p) can be 


@, 4, AUG, eg 


where s = |r/2|. Since 2s+1<r+1< p (p > 3), we see that p must be 
defective with respect to (U;). 


In the case of (D|p) = —1 and Q = —1, we have the following result. 


Theorem 73 If p> 11 is a prime such that (D|p) = —1 and Q = —1, then 
p is defective with respect to (U;,). 


This result was partially demonstrated by Somer [31] and completely 
proved by Schinzel [27], using a different technique. Later, Li [13] established 
this result by extending Somer’s methods. 

Now that all the possible primes that are nondefective in (U;,) can be easily 
identified, it is feasible to extend Theorem 67 to include all sequences (U;,) 
with |Q| = 1. This was done in exhaustive detail by Somer and Krizek [32]. 
As the results in [32] are quite lengthy and detailed, we refer the interested 
reader to that paper for further information and references. 
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Chapter 5 
Some Properties of Lucasnomials 


Quand on sait dire, on peut tout faire. (Marcel Proust) 


Lorsque les connaissances sont un amas d’erreurs et de vérités, indistinctement 
mélées ; lorsqu’une longue ignorance et beaucoup de siécles leur ont laissé jeter des 
racines profondes, la séparation en est difficile. L’ancienneté ne prouve rien. Le 
respect, la croyance de plusieurs dGges ne sont que des préjugés. (Sophie Germain) 


Abstract Lucasnomials, or Lucasnomial coefficients, are a generalization of 
the usual binomial coefficients (7), when n and k are integers. Lucasnomials 
()py are associated with every nondegenerate fundamental Lucas sequence 
U. Their definition includes ordinary binomial coefficients which we get when 
U =U (2,1), ie., if U is the sequence of natural numbers. Remarkably, various 
arithmetic properties of binomial coefficients hold in a more general form in 


this context. 


5.1 Definition. Connection with g-Binomial Coefficients 


Definition 5 (Lucasnomials) Suppose U = U(P,Q) is a nondegenerate 
fundamental Lucas sequence. Assume n and & are integers. We define the 
Lucasnomial, or Lucasnomial coefficient Cis as follows. 


UnUn—1- Un—k . : 
| (ae, ite 

& —— en if k = 0; (54) 
k U 


ifk <0. 


? 


As we will see in the next section, Lucasnomials are always integers if 
n > 0. Lucasnomials are called Fibonomials if U is the Fibonacci sequence, 
that is, if U = U(1,—1). A number of properties of general Lucasnomials 
were first hinted at through studies of Fibonomials. 


© Springer Nature Switzerland AG 2023 105 
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We point out here the close connection between qg-binomial coefficients 
or Gaussian coefficients, introduced by Gauss, and Lucasnomials. Formally, 


Gaussian coefficients (t) 4 are defined by replacing any term U;, i > 0, in 


Equation (5.1) by the g-number 


igi lt gtgt- tat = 2 =Ug+1,9). (5.2) 

The q-binomial coefficients are defined as rational functions in the variable 
q, which actually turn out to be polynomials in Z[q]. If we assign and fix an 
integer value for q, with |q| > 2, then, by Definition 5, they become Lucas- 
nomials with respect to the sequence U(q + 1,q). Gaussian coefficients turn 
up in various mathematical areas and have also been the object of study of 
many articles. Their properties also generalize properties of binomial coeffi- 
cients. By letting the variable q tend to 1 in an identity involving Gaussian 
coefficients, we often either recover or discover an identity satisfied by bino- 
mial coefficients. But it is also possible to obtain a Lucasnomial identity from 
an identity for g-binomial coefficients directly, and vice versa. The upshot is 
that there are two main roads to the study of Lucasnomials. One uses Lucas 
sequences and their arithmetic properties, or particular Lucas sequences such 
as the Fibonacci sequence, while the other main source comes from q-calculus. 
To convert a q-identity into a Lucasnomial identity, where U = U(P,Q), 

a and f are the zeros of «? — Px + Q, or vice versa, one may use the trans- 


formation formulas > 


a Op (5.3) 
and n gn a4 
_ a = n—-19° — 
Un = a = pt, (5.4) 


and, therefore, 


n k \ (6 n n 
_ a | ) 7 pron ) . (5.5) 
a k} K} 
Example 74 Consider for n > k > 0 the q-identity 
-—1 -1 
(i), <7) a), 69 
q q — oq 


which has elegant combinatorial proofs, but is easy to check algebraically. 
Using (5.3) and (5.5), we see that 


y(n 7 : n—1 n—1 
gre ( ) = GQ" g-Pk a-ha Ik) ( ) 4 Brine ®) ( ) : 
k) y k Jy ba) 
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Multiplying by 6*(—*) and using Q = af, we obtain the Lucasnomial iden- 


tity 
n n—-1 nk (2-1 
(0 paca (a en 


Occasionally, it may also be worth viewing Lucasnomials as polynomials 
in Z[P, Q], a result we will establish in the next section. 


5.2 Integrality 


Lucas [56, p. 203] stated that the product of any n consecutive terms of a 
sequence U is divisible by its first n terms, with a near-complete justification. 
If, say, binomial coefficients are introduced via their factorial formula, then 
one may prove their integrality using the Pascal identity (5.19) and an induc- 
tion argument. This is, in fact, how Lucas justifies his statement [56, p. 203] 
after giving a simple proof of the generalized Pascal identity (5.20). Many 
proofs of the integrality of Lucasnomials in the more recent literature actu- 
ally follow this path. Ward [79] proved that generalized binomial coefficients 
(see Definition 6) with respect to a strong divisible’ sequence X = (2n)n>0 
are integers. Hence, his result implies the integrality of Lucasnomials when 
gcd(P,Q) = 1. Combinatorial interpretations of Lucasnomials are, among 
other means, yet other proofs of their integrality. We will mention these in 
Sect. 5.4. By viewing Lucasnomials as rational functions in P and Q, we will 
prove them to be polynomials in Z[P,Q] using a remark made in the proof 
of [71, Prop. 2]. That they are integers immediately follows. 

Note that, by (2.31), for a pair of integers P and Q, U;,(P,Q) is the value 
of the polynomial W,,(P,Q) in Z[P, Q] for all n > 0. By (2.32) or (2.34), we 
also see that V,(P,Q) is a polynomial in Z[P, Q]. 


Definition 6 (Generalized binomial coefficients) Suppose X = (%n)n>0 
or (%n)nez is a sequence of complex numbers, polynomials, or functions with 
Lo = 0 and xz, 4 0 (n > 0). We define the generalized binomial coefficient 
with respect to X for all n > 0 or all n € Z, as the case may be, as follows: 


Enln—-1°**ln—-k . 
nt&n-1 n—k+1 ifk> 1; 


n URUR-1 °° L1 aq 
ie di, ifk =0; (5.8) 
= 1G, ifk <0. 


Remark 24 Lucas did not formally define Lucasnomials. An abstract general 
definition, such as Definition 6, appears in a 1915 one-page note of Fontené 
[28]. However, Ward’s 1936 paper [80] might have been the first paper to 


+ That is, a strong divisibility sequence as is more traditional to say. 
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develop properties of generalized binomial coefficients with respect to an ab- 
stract sequence of complex numbers X = (2n)n>0 satisfying xo = 0, v1 = 1, 
and «, # 0 for n > 2. He uses the pretty terminology “binomial coefficient 
to the base X.” 


Lemma 31 Suppose X = (%n)n>0 OF (2n)nez 18 a sequence of nonzero com- 
plex numbers or functions except that x9 = 0. Ifk > 1 and n are integers 
and 

TEn = Sln_pt trp, (5.9) 


where r, s, and t are variables, or complex numbers or functions, then 


(aa) Ho) (5.10) 
(ate i), 


S8In—k + tp 


Proof Indeed, 


| 
8 
= 
fo OS 
x 3 
I | 
a 
Se 
= 


Definition 7 If n and k are integers, n > 0, then the Lucaspolynomial C) é 
is the generalized binomial coefficient with respect to the sequence of poly- 
nomials (W,)n>0 defined in (2.31). 


We need a small lemma to convert some Lucas identities into polynomial 
identities. 


Lemma 382 If two complex-coefficient polynomials in two variables f(P,Q) 
and g(P,Q) are equal for all nonzero integer values of P and Q, then f = g. 


Proof We may fix the variable Q and obtain two polynomials fg and gg in 
P which, being equal for infinitely many values of P, must be identical. The 
coefficients of fg and gg are polynomials in @ which are equal for infinitely 
many values of Q. Hence, these polynomial coefficients are identical on both 
sides. Thus, f = g as polynomials in P and Q. 


For an example, the identity (2.42) 


Um = niiUm=n = QUm—n—1Un,; 
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valid for all Lucas sequences U, holds for all nonzero values of P and Q. Both 
sides of the equation can be viewed as polynomials in P and Q. Therefore, 
by Lemma 32, we see that 


Yn = n+1Ym-—n = QUn—n—-1Yn- (5.11) 


Theorem 75 Suppose n > k > 1 are integers. Then the Lucaspolynomial 
Cs is a polynomial in P and Q with integer coefficients. 


Proof We saw just in (5.11) that 
Wn = Wep1Vn—k — QUn—K-1We. 


By Lemma 31 with r= 1, s = 41, and t = —QW,,_,_1, we obtain the 


identity 
n n—-1 n—-1 
(), 7 Yon( k ), ae é a ae ae 


That (7) y is in Z[P, Q] follows by the verification of the initial values and 
induction on n. 


Corollary 76 If U(P,Q) is a nondegenerate fundamental Lucas sequence 


and n > 0 and k are integers, then (;),, is an integer. 


Proof Note that the nondegeneracy of U makes Lucasnomials well defined. 


If k <0, then (;),, = 0. If k = 0, then ({{),, = 1. So we may assume k > 1. 
If0<n<k, then, since Up = 0, (;),, = 0. Assume n > k > 1. Theorem 75 


implies that eae is an integer. 


We now state a useful corollary to Lemma 31. 


Corollary 77 Suppose X = (2n)nez is a sequence of complex numbers or 
functions with xz, = 0 if and only if n = 0. Assume for all n and k 


TLn = SIn_-p t+ tee, 


where r, s, and ¢ are functions of n and k such that r = s ifn =k =0. Then 
Equation (5.10), ie., 


a dee) g 


holds for all integers n and k. 


Proof This follows from Lemma 31 for all integers n if k > 1. If k < 0, then 
both sides of (5.10) are null. If k = 0, then (5.10) holds if and only if r = s. 
By hypothesis, ra, = st, + trp = sty. Ifn #0, then x, #0 and sor = s. If 
k =0 and n = 0, then r = s by hypothesis. Thus, (5.10) holds in all cases. 
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Corollary 78 Suppose U(P,Q) is a nondegenerate fundamental Lucas se- 
quence and n and k are integers. Then 


n n—-1 n—-1 
().=to(),-ahe( ea 


Proof The corollary follows from the polynomial identity (5.12) for n > 
k > 1. But applying Corollary 77 to the Lucas identity U, = Up4i1Un_-x% - 
QU,,_-~-1U, with X =U, r =1, s = Ugyy, and t = —QU,_,_1 yields the 
full result. 


Unfortunately, identity (5.13) is not a direct generalization of the addition 
property (™ le = CS ») + C= ‘ of binomial coefficients. Such generalizations 
exist. Indeed, identity (5.7) is one of them. But we will present some others 


in the next subsection. 


5.3 Ten Basic Identities 


In [33, p. 174], ten basic essential properties of binomial coefficients (,), 7 
complex, / integral, are listed. We borrow the names of these properties from 
the book [33] and present their extension to Lucasnomials. Since we did not 
attempt to generalize Lucasnomials for r complex or real, we take r to be an 
integer n. We assume U to be a nondegenerate fundamental Lucas sequence. 
Putting P = 2 and Q = 1 yields the corresponding binomial identity since 
for all integers n, U,(2,1) =n. Note that V,,(2,1) = 2 for all n. 


1. Factorial expansion. For 1 < k < n integers, the formula 


n UnUn—-1°°+ U1 
= 5.14 
(*). (oi ge) (5.14) 
is obtained by multiplying numerator and denominator in Definition 5 by 
‘aes ee 
2. Symmetry. For all integers n > 0, we find that 


(),-(ea), a 


If k < 0, then n—k > 0 and U,U;,-1 +++ Un—(n—n)41 = UnUn-1°+* Vea = 0 
and both sides of (5.15) are zero. Both sides are 1 for k = 0. The identity 
holds for k > 1 by (5.14) since n — (n—k) =k. 


3. Absorption. For k nonzero and from Definition 5, we find that 


("), 7 7 rae (5.16) 
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4. Upper negation. The formula 


() acre) 


holds for all integers n and k. 


Proof If k = 0, then both sides of the formula are 1. If k < 0, then both 
sides are 0. Assume k > 1. Since 55, (e —i)=kn- a and, by (2.7), 
Q/U_; = —U;, we see that 


kn BS? k—=n-—1\ (02 DU ni) (QU pa) > (Q"U_n) 
a is One 
k UnUn-1 er ee Un—k+1 


= Ue) 


CRO RG teas 6.18) 


holds for all integers m, n, and k. 


Proof If k < 0, then both sides are 0. Both sides are (‘"),, if k = 0. So we 
assume k > 1. Again, both sides are equal ifn < k, whethern <0,0<n<k, 
or n = k. Thus, now suppose n > k > 1. For all integers m, we may write 


We (tee Unie Uae Ue pees 
oe), U,V Oye Uy 

pon Enea es bere Up Une 

(Ux +++U1)(Un—e-+U1) Un ++ Unk 

UmUm-1 + Um_n41 Un- ++ Un—k41 
U,, ++: Uy U;, +++ U, 


Remark 25 As noted by Gould [34, p. 26], the identity (5.18) holds for any 
generalized binomial coefficients in the sense of Definition 6. 


6. Addition/Induction. The identity 


(;) ~ (" i ') 7 te ae (5.19) 
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sometimes called, particularly in countries influenced by French culture, the 
Pascal identity, has the direct extension 


n Ve f(n-1 Vanek (n—-1 
(Fees ae ( 


which holds for all integers n and k. 


Proof If k < 0, then both sides are 0. If k = 0, then both sides are equal to 
1 since Vo = 2. Thus, we assume k > 1. Applying Lemma 31 with the Lucas 
identity (2.10), written in the shape 2U,, = VprUn—x~ + Vn—~Uz, where r = 2, 
s = Vz, and t = V,_x, we obtain Equation (5.20) after division by 2. 


Remark 26 Another direct extension of Pascal’s identity was given in (5.7) 
since a = 6 = 1 for U(2,1). Actually, identity (5.7) is valid if we interchange 
the roles of the zeros a and 6. Thus, adding (5.7) to its conjugate identity 


({). = pt eo ) terete a" (5.21) 


yields a second proof of (5.20). Note that we got (5.7) from the q-identity 
(5.6). However, we can derive (5.7) from Corollary 77 and the identity U, = 
a*U;_~+B"—-*U;. The same holds true of the q-identity (5.6), since q” —1 = 
q‘(q”~*-1)+q*—1 implies that ng = q*(n—k),+kg. Finally, we observe that, 
by Corollary 77, (5.7) is true for all integers n and k, not just forn > k > 0. 


Remark 271t is interesting to note that if one applies (5.20) to the Lu- 
casnomial eae er where k > 1, multiply the equation that results by 


(—1)kQer- and perform upper negation on all three Lucasnomials, 


then, after some simplification, one obtains 


p{m _ n+1 7 n 
2) =m ("E ela), 


Up to an index shift, the preceding equation follows from applying Lemma 31 
to identity (2.12), written in the form VU, = 2Q*Un_x% + VaUz. In fact, the 
identity (5.22) is as much a direct extension of the usual addition of binomial 
coefficients as the identity (5.20). Thus, we record it below. The equation 


(eC) eKED, 0 


holds for all integers n and k. 


7. Upper summation. The summation on the upper index 


2 U-Gy 
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derives naturally from the Pascal identity. It was noted by Gould [34, eq. 
(13)] that for generic generalized binomial coefficients with respect to some 
X = (%pn)n>0, we have the general formula 


ee Lk k x k+1 x 


Now since we have several Lucasnomial generalizations of the Pascal identity, 
we obtain several corresponding generalizations for the upper summation 
formula. One comes from identity (5.7): 


+1 
(n—i)(K+1) gi-k (;,) = é ) (5.23) 
a , : 
2, be pat) s 


O<i<n 


which holds for all nonnegative integers n and k. 


Proof Fix k > 0 to an arbitrary value, and perform an induction on n > 0. 
Then express (ave as atl (aos + rti-k(n i?) |. using identity (5.7), 


HI HI 
and we readily obtain, using the inductive hypothesis, that (Ga 


S- eee ae) 
k}y 


O0<i<n+1 


ie equals 


Similar inductive proofs yield, for all nonnegative n and k, the generalizations 


; ai] 
aman Acasa (Aa (;) - & ) (5.24) 
> k+1 k o k+1 és 


O<i<n 


and 
n—iyyi-k— a ne frtl 
Serr ye: Visa ( ) = Vest ) (5.25) 
0<i<n k}y k+l/y 


using, respectively, the generalized Pascal identities (5.20) and (5.22). 


8. Parallel summation. Parallel summation 
3 ea) 7 wu 
é 1- J} k 

i<k 


also derives naturally from the Pascal identity. Using one of the three gener- 
alizations (5.7), (5.20), or (5.22) of the addition/induction formula, we obtain 
three respective generalizations of parallel summation, which are 


pamem(t)=<CE a 


i<k 
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: Bod j k+1 

Se) _ en ) . (5.27) 

i<k a /u U 
and 

i-1 k-1-i : k 
A i n+i n+k+1 

HT] vw TT vorwa-a("P) =qIwo("* 2") . 628) 
i<k j=0 j=0 Uo 55 U 


where as usual an empty product is equal to 1. The three identities (5.26), 
(5.27), and (5.28) hold for all integers n and k. 


Proof We provide a proof of, say, (5.27). Similar proofs for (5.26) and (5.28) 
are possible. We perform an induction on k. If k < 0, then both sides of (5.27) 
are zero for all n. Thus, the induction is well based. Fix @ > —1, and assume 
(5.27) holds for & = @ and all n. We verify that it holds for k = +1 using 
(5.20): 


— Vert ee | Vat aw 

: ert Je 2 e - 
_ Ver (" aia ‘) 4 Vntt > g-t4H1-Dy, ye (" 4 ‘ 
7 tent , 

; le y i<l a U 
_ Ver (nt+e+1\ ‘ 9 ((e+1)+1-A yy (e+) (7 4¢ 
= 5 +1 A ave 

U ixe r 
= ‘> (4D) yyy Ge | 
i<0+1 " 


which terminates the inductive proof. 


9. Binomial theorem. The binomial theorem given in the form 
a+ey"=>0 MY ok (n > 0) 
a k y] Ears 


generalizes into the following Lucasnomial theorem: 


[lt tai") = Soa) @ xk, (n>0) (5.29) 
i—0 k=0 U 


where U(P, Q) is a nondegenerate fundamental Lucas sequence and a and 3 
are the zeros of «2 — Pr + Q. 
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Proof We proceed by induction on n. Identity (5.29) holds for n = 0 and 
n = 1. Assume it holds for some n > 0. If we multiply (5.29) through by 6”, 
distributing one § for each factor on the LHS, and put y = zx, we obtain 
writing x instead of y 


n-1 n 
II (x +a0'8"-*) = S- ar-kQ("2") (;) a, 
i=0 k=0 U 


Multiplying both sides of the above equation by (a + a”), the LHS is the 
desired product [T}_)(a+0'8”"~*), while on the RHS, the coefficient cy of x” 
is 


Ck = argrgts) + peg) é ‘ 1) 
U U 


= ak .gr-kal"2") & Re ae é ). 


—1 
-aH(o(9) v6"), 


ra ae (" : ‘) 


where the last line comes from (5.7). This completes the induction. 


A formula equivalent to (5.29) but for g-binomial coefficients is proved in 
[42, Thm. 348] with techniques that date back to Euler. In fact, Ward [80 
mentions the 1748 formula of Euler 


1, n n T(r— TF .- 
fi Gietayacted w= (2) a D/2gn—ryr. 
qd 


r 
r=0 


Jarden and Motzkin [44, 45] proved that the product of n—1 (n > 1) second- 
order recurring sequences, all with characteristic polynomial x? — Pz + Q, is 
an nth-order linear recurring sequence annihilated by the polynomial 


n 


f(x) = S0(-1)* @ ee ae 


k=0 


If a and # are the zeros of 2? — Px + Q, then a product of the form 
"1 (e1,n0° + c2,48") is clearly annihilated by the polynomial 


n-1 


II (a _ ap), 


i=0 


The above polynomial and f(x) have the same degree, and, in fact, we have 
the identity 
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n n-1 

p [ 1 _ n—k ten-1—-i 
yene(7) Qh DAgn* = [] @— aight), 
k=0 U i=0 


which is equivalent to (5.29). (Compute (—1)"f(—a) and change n — k into 
k to find (5.29).) Also, the formula of Jarden, namely, 


NG — a Br") = ene) & oe 


appears in Carlitz’s work [22], but for the Fibonacci sequence, F' = U(1,—1). 
That is, a and 6 are the zeros of «7 — x — 1. 


10. Vandermonde convolution. The Lucasnomial identity 


Mrn\y m(k—i) gin j7—i(k—i) ( ™ n 
0 eee 9 (er 


where a and £ are the zeros of 77 — Pr + Q and U = U(P,Q), generalizes 
the classical Vandermonde convolution 


m+n m n 
Cr) 
Identity (5.30) holds for all integers m, n, and k. We begin with proving 
(5.30) for m and n nonnegative integers and any integer k. 


Proof We carry out an induction on m+n > 0. If m+n = 0, thenm =n = 0. 
Both sides are 0 unless k = 0, in which case both sides are 1. Let us assume 
(5.30) holds for some m+n > 0 and all k. Then 


(anaes ra eens 
= +a 
k a k Jy k-1)y 
— pk m(k—1) gin W—i(k—i) [ ™ nm 
a ramespigreo(T) (0), 


m+n+1—k m(k—1—i) gin 7—i(k—1—%) [ ™ nm 
Ree eT a) 


= So amh—a) git) Q-ilk-9) (") x 
F U 


a 


a ) oa uae | ; ) 

( k— a U k-1-1 U 

=. > aha) gilt lh) Q—th—2) (") é + 4 
i t)y\k-isy 


where the first and last equalities come from Equation (5.21). 
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To prove (5.30) holds for all m,n, and k, it is convenient to use the 
equivalent g-Vandermonde convolution identity 


CBee me) (2), 


which can be seen to be equivalent to (5.30) using Equations (5.3) and (5.5). 
Note that for g = a/8 = 1, we fall back on the usual Vandermonde convo- 
lution. If k < 0, then both sides of (5.30) are zero. Thus, we assume k > 0. 
Say we fix q > 1 real, n > 0 and k > O two integers, then we can view 
(5.31) as an identity between two real polynomials in X = q™ each of de- 
gree k. (For each 0 < i < k, (2), is a degree-i polynomial in X, while 
gk) — g-k-Y) Xk-7 has degree k—i.) As (5.31) holds for more than k 
values of X, i.e., for all g’, m > 0, the two sides as polynomials in X must be 
equal. The coefficients of X*, 0 < @ < k, are polynomials in Y := q” of degree 
at most k on both sides of (5.31). As they agree for all values of Y = q”, 
n > 0, more than k values, these polynomials must be identical. Therefore, 
(5.31) holds for all X and Y, in particular, all X = g™ and Y = q”, m and 
n in Z. 


5.4 Combinatorial Interpretations 


Ifn > k > 0 are two integers, then the binomial coefficient a) is the number 


of subsets of size k of a set with n elements. This well-known and useful 
combinatorial interpretation of binomial coefficients is often taken as a def- 
inition. In an xy-coordinate system, () is also the number of paths from 
(0,0) to (k,n—k), or to (n—k,k), where a path is a continuous succession of 
eastward or northward one-unit steps. Reasons for seeking a simple combina- 
torial interpretation for Lucasnomials are of at least two kinds. On the one 
hand, there is an obvious mental satisfaction associated with such a finding. 
On the other, they are potentially useful: Some identities—identities have 
two sides—are given new proofs based on counting objects in two different 
ways, one for each side of the identity. This approach may also suggest new 
identities. We note that this combinatorial manner of proving identities was 
the object of a whole book [16]. 


Existing interpretations of Lucasnomials count weighted tiled paths. But 
we distinguish two main families of interpretations. Thus, we divide Sect. 5.4 
into two subsections. 
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5.4.1 Square-and-Domino- Tiling Interpretations 


Given a rectangular 1 x n board, it is fairly well known that the number of 
ways to tile this board using square tiles which occupy one cell and dominos 
which cover two consecutive cells of the board is F,41, where F;, is the n-th 
Fibonacci number. A variant of this is: In how many ways can one reach the 
top of a staircase of n steps if you can go up either one or two steps at once? 
(In fact, as mentioned earlier, the Fibonacci numbers were known in ancient 
India well before the days of Leonardo de Pisa: Both in some Sanskrit poetry 
and Tabla music, the rule was to use short and long syllables or short and long 
beats twice as long as the short ones. The number of combinations of a given 
length, thus, was a “Fibonacci” number. In Fibonacci’s population dynamics 
of a group of rabbits, you also have the mature rabbits that reproduce every 
period and the young ones that become adults after one period and, thus, 
require two periods to reproduce.) But let us return to our tilings with an 
example: If n = 3, then there are Fy, = 3 tilings, namely, sss, ds, and sd, 
where s and d stand for a square and a domino, respectively. For each “s- 
and-d” tiling T of an 1 x n board, we define the weight w(T) of the tiling T 
as the product of the weights of each of its tiles, ie., w(T) = P*(—Q)¢. The 
weight |T| of a set of tilings T is defined as the sum )),-7 w(T). Thus, if Tn 
is the set of tilings of a rectangular 1 x n board using squares and dominos, 
then we find that 

On+1 = [TnI (5.32) 


where U = U(P,Q) and, by convention, |7o| = 1. This follows by induc- 
tion since tilings in J; ending with a square contribute P|7,_1| to |Tn| 
and tilings ending with a domino contribute (—Q)|Tn—2|. Hence, |7,| = 
P\Tn—1| — Q|Tn—2|. Clearly, |7i| = P = U2 and |72| = P? — Q = U3. Note 
that if P > 1 and —Q > 1, then tiling weights bear a concrete interpretation: 
We may think of squares coming in P different colors and dominos in yet 
—Q distinct other colors. The Fibonacci case is singular as P = —Q = 1 so 
that the weight |7| of a set of tilings 7 is simply the cardinality of T. Note 
that our use of squares and dominos as the tiling modus operandi is linked to 
our observation (2.30) that U,(P,Q) = %,(P, Q), where W,, is a homogeneous 
polynomial of degree n — 1, if the variable P has degree 1 (the length of a 
square tile) and Q has degree 2 (the length of a domino). 


Example 79 The number of tilings with squares and dominos of a 1 x (m—1) 
rectangular board is F,,. Consider the number of such tilings that use a 
domino d to cover the two consecutive cells n and n+ 1. There are F,, ways 
of tiling the initial cells 1,...,n—1, and F,,-n_-1 ways of tiling the m—n—-2 
final cells n+2,...,m—1. Hence, there are F;, Fy, -n-1 ways of simultaneously 
tiling the cells left of d and the cells right of d. The remaining tilings do not 
have a domino straddling over cells n and n+1. So we can tile the first n cells 
(in Fy41 ways) and the last m—n— 1 cells (in Fy,» ways) independently. 
Therefore, Fi, = Pr Fm—n—-1t+Fn4ifm—n- With P-weighted squares and —Q- 
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weighted dominos, the same argument carries over, but the domino d has 
weight —Q, so this reasoning yields a combinatorial proof of the important 
identity (2.42), i.e, Um = Un41Um—n — QUnUm-—n-1- 


Thus, the numerator U,Uy_1-+-Un—r41 of ( can be interpreted as the 


number of simultaneous square and domino (weighted) tilings of a collection 
of boards of respective length n—1,n—2,...,n—k. Using this interpretation, 
Benjamin and Plott [17] were able to describe a procedure by which one could 
view the denominator U; ---U, of es) y as subtilings of the numerator. Their 
procedure provided a first combinatorial explanation of the integrality of 
Lucasnomials. 


Interestingly, the associated Lucas sequence number V,, = V,,(P,Q),n > 1, 
counts the number of circular weighted tilings of a 1 x n rectangular board. 
The tilings are made of squares and dominos with the same respective weights 
of P and —Q, but it is possible to have a domino straddling over the n-th and 
the 1-st cells. These are sometimes called bracelet tilings. Thus, ifn = 2, then 
we can tile in three ways, two squares, or in two ways, one domino, giving 
V2 = P? — 2Q. 


Example 80 We may prove again the first identity in (2.26), ie., Vn = 
Um+1 — QUm-_1, by observing that there are two kinds of tilings of a length- 
m bracelet: the —-QU_1 tilings with one domino covering both cells m and 
1 and the U,,4, tilings with no such domino. 


mn) were 


In 2010, two combinatorial interpretations of Lucasnomials ( a 
found by Sagan and Savage [71]. The second uses circular tilings. We only 
describe the first. Any lattice path from (0,0) to (m,n), where m and n are 
positive integers, i.e., a path consisting of m+n unit steps, m going eastward 
and n northward, leaves n rows (m,mg,...,7Mn) on its left, bounded by 
the y-axis, and m columns (n1,72,...,%m) below the path and bounded 
southward by the x-axis. By tiling each row m;, 1 <7 <n, and each column 
nj, 1 < j < m, with squares and dominos with the caveat that the tilings 
of the columns—to be legal—have to begin with a domino, we associate a 
weight to the given path. Here, the weight of one particular tiling of the m 
rows and n columns is the product of the individual weights of each row and 
column. The path weight is the sum of the weights of all legal tilings of the 
set of rows and columns associated with the path. Then ony is the sum 
of the path weights over all paths from (0,0) to (m,n). If a column has 0 
height, i.e., ifn; = 0, then this column is tiled with the empty tiling, which 
has a weight of 1. Although it does not begin with a domino, it is still legal. 
If the j-th column has only one cell, i.e., ifn; = 1, then no legal tiling exists, 
and the path weight is zero. Such a path does not contribute to eae If 
U = F, then all legal tilings have weight 1. Thus, epee p counts the number 
of ways of drawing a path from (0,0) to (m,n) and legally tiling the rows 
and columns. Hence, for a simple example, we choose U = F’, m = 3, and 
n = 2. Of the ten paths from (0,0) to (3,2), only four contribute, namely, 
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pi = NNEEE, pp = ENNEE, p3 = EENNE, and py = EEENN. The 
two northward steps have to cling together for the path to have a legal tiling. 
The paths p; and pz have a weight of 1 because there is only one way to tile 
each of its rows and columns. Path p3 has weight 2 x 2x 1x 1x 1= 4 for its 
two rows each have two tilings, while p, has weight 3 x 3 = 9 from its two 
rows of length three. Thus, the sum of the weights of all the legal paths is 


(de Sia 6-3) SP e/e \= & 
F 


Remark 28 If instead of the Fibonacci sequence F’' = U(1,—1), we chose I = 
U(2, 1), the identity sequence, then squares have weight 2 and dominos weight 
—1. Thus, using the above example, 


=-1+4-9+16=10, 


which is the binomial coefficient (3) as expected. However, this is certainly 
a roundabout way of counting the number of paths from (0,0) to (3,2). 
This remark points out a drawback of the interpretation in which, ideally, 
we would like to count paths in a natural way when dealing with ordinary 
binomial coefficients. 


A few years later [18], these two interpretations of Lucasnomials were re- 
stated in simple terms and illustrated with examples, mostly in the Fibonacci 
case. A couple of identities were combinatorially explained, and a few simple 
and appealing Fibonomial identities, taken from the existing literature, were 
listed as awaiting interpretation. 


Writing k!p for Fi, F,-1--- Fi, (k > 1), the combinatorial interpretation 
given by Sagan and Savage implies that 


(m+n)lp=mlp-nle- Sob, (5.33) 
P 


where p is a path from (0,0) to (m,n) and @, is the number of legal tilings 
of the n rows and m columns associated with p. Viewing k!r as the num- 
ber of ways of simultaneously tiling (with squares and dominos) k rows of 
respective lengths k — 1,k — 2,...,0, Killpatrick and Weaver [46] produced 
an explicit combinatorial bijection, as we may say, between the two sides of 
Equation (5.33). 


Recently [19], another combinatorial interpretation was discovered. It is 
similar, and in some sense isomorphic, to the Sagan and Savage interpretation 
we just described (see [19, end of Sect. 2]), yet more natural and apparently 
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more fruitful. In particular, it led to a first interpretation of Lucasnomial 
Catalan numbers, which we describe in Sect. 5.5.3. 

Let s, be the upward staircase path which starts at the point (n,0) and 
ends at (0,7) alternating west and northward unit steps: 


(n,0) + (n- 1,0) > (n—1,1) > (n- 2,1) 3 --- > (0,n—-1) > (0,n). 


The staircase-shape region S,, to the right of the y-axis, below the path s,, 
and above the z-axis, consists of n— 1 horizontal rows of square boxes. Going 
upward, these rows are of respective lengths n — 1,n — 2,...,1. A tiling of 
S,, is a tiling of each of these n — 1 rows with squares and dominos. If, as 
before, a square tile has weight P and a domino has weight —Q, then there 
are U,U,_1---U; distinct tilings of S,. Here, U = U(P,Q). 

Let k, 0< k <n, be a fixed integer throughout the interpretation, since 
this interpretation is specific to the Lucasnomial (7) ,,. We begin by describing 
how, to any given tiling T of S,,, we associate a unique path p through S,,. 
The path p begins at the point (k,0) and ends at (0,7) and is constructed 
uniquely as follows: 

The path is made of a succession of unit steps going either north or west. 
The path p moves northward unless we step out of the region S,,, or unless 
a northward edge would cross over through the center of a domino in T, 
and, in these two cases, the path moves west. Note that a westward step is 
necessarily followed by a northward step, because of the staircase shape of 
S,, and because dominos have length 2. 

Two tilings JT; and T> of S, are equivalent if their associated paths p 
coincide. The number of tilings of an equivalence class will be shown to be 
a multiple of Dy = (Ux ana U1)(Un—k le U;). Therefore, UU n_-1 aed U;,, the 
number of tilings of S,,, i-e., of all rows in S,,, must be a multiple of D;, and 
ee be an integer. 

But let us return to the path p associated with T. A northward step 
immediately following a westward step is called an NL step, because the 
two successive steps draw an L. Other northward steps are called NI steps. 
Construct a partial tiling B = Br, a subtiling of T, by, in a given row, erasing 
the tiles that lie left of an NL step and erasing the tiles that lie to the right 
of NI steps. Note that two tilings of S, which produce the same partial tiling 
B must lie in the same equivalence class. The number of ways of completing 
the partial tiling B into a full tiling of S,, is D,. Indeed, p must have k NL 
steps, and the i-th NL step has k — i blank cells on its left, for 1 <<i< k. 
Thus, k — 7 varies from k — 1 to 0, and these blank partial rows can be tiled 
in U;,---U, ways. There must be n — k NI steps for the path p to reach the 
final point (0,7) at altitude n. The i-th NI step, 1 < i <n—k&, must have 
n—k-—z7 blank cells on its right, because, due to the staircase shape of S,,, the 
number of cells right of an NI step is not affected by the number of NL steps 
that precede it. Thus, the blank partial rows right of NI steps can be tiled 
in Un_,~-+:U, ways. Therefore, w(T) = Dy, + w(B). An example of a tiling T 
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in Sg is illustrated in Fig. 5.1, where an NI step is designated by an N anda 
west step followed by a north step by WN. 


n=6,k=4 


tiling T: sdd, ssd, sd, ss, s 


partial tiling By: shaded area 


path p: WN, WN, WN, N, N, WN 


0 1 2 3 A 5 6 


Fig. 5.1 Example in Sg 


We will say that a path p in S, from (k,0) to (0,n) is tilable if it is 
associated with a tiling T of S,, in the way previously described. Note that 
not all (‘*) lattice paths from (k,0) to (0, ) made of westward and northward 
steps with no two consecutive westward steps are tilable. For instance, only 
three of the six such paths through Sy, starting at (2,0) are tilable. A partial 
tiling B is said to belong to p, and we write B € p, if B = Br for some tiling 
T of S, with which p is associated. Two equivalent tilings do not usually 
produce the same partial tiling, because the non-erased parts of these tilings 
do not usually agree. If B is the partial tiling derived from T, we showed 
above that w(T) = Dy - w(B). Therefore, 


Un-+-U1 =|Sn| = >_ |Cl(p)| = SS > w(B) - Dy, (5.34) 


p Bep 


where the sums are over all tilable paths p from (k,0) to (0,n) and Cl(p) is 
the class of all tilings to which the same path p is associated with. Thus, the 
Lucasnomial an is interpreted as a sum of weights of partial tilings. That 


7 (i), =S~ >> w(B), (5.35) 


p Bep 


where the outer sum is over all tilable paths p from (k,0) to (0,7). 
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5.4.2 An Interpretation Coming from q-Binomial 
Coefficients 


We begin with a well-known interpretation of q-binomial coefficients and 
mention two of its isomorphic variants before seeing how this interpretation 
carries over to Lucasnomials. 


Indeed, the Gaussian coefficient (2) ‘ is well known to count weighted paths 
from (0,0) to (k,n—k). These paths are non-decreasing in that they use only 
northward or eastward unit steps. To be precise, the identity 


k(n—k) 


().- by Cag’ (5.36) 


holds, where cq is the number of paths from (0,0) to (k,n — k) such that 
the area under the path and above the z-axis is a. Thus, the weight of a 
path of area a is q*. When q = 1, all paths have weight 1 and Ch) 4 a7), 
which simply counts paths. As observed in Remark 28, interpretations of 
Lucasnomials given in [19, 71] have the drawback of not falling back on this 
natural interpretation of ordinary binomial coefficients. 


Konvalina [51, eq. (19)] observed that the g-binomial coefficient Ch) q sat- 
isfies the identity 


@ = S- git tiate tie (5.37) 
q 


OS%1 Sig S++ SipSn—-k 


which he interpreted by using boxes and balls. Suppose there are n —k +1 
boxes, where box i, 0 <i < n—k, contains gq’ balls. Then (ha is the number 
of ways of selecting k boxes and one ball from each box. The same box may 
be chosen several times, the ball being returned to its box, if formerly chosen. 
Note that this interpretation is in one-to-one correspondence with the above 
area-weighted path interpretation. Say we walk along a given path from (0,0) 
to (k,n—k). We will encounter exactly k horizontal edges of successive heights 
O<hy < ho <-++< hy <n—k. The vector (hi, ho,...,hx) actually defines 
the path and corresponds to choosing the boxes numbered hy, hg,..., hx. The 
number of ways of choosing the k balls is qui i which is q*, where a is the 
area subtended by our path. 


Recently, a variant of the above interpretation of g-binomial coefficients 
was exploited to explain combinatorially various q-identities [3]. The authors 
chose to work with a tiling interpretation “isomorphic” to the weighted path 
counting (5.36). That is, there is a one-to-one weight-preserving correspon- 
dence between paths and tilings. Tile a 1 x n board with n square tiles, k 
blue and n — k of color red. Walking along the path from (0,0) to (k,n —&), 
place a red tile for a north step and a blue one for an eastward step. The 
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weight of a red tile is 1, and the weight of a blue one is q’, if that blue tile is 
preceded by 7 red tiles. The weight of the board tiling is the product of the 
weights of individual tiles. This bijection clearly preserves weight as the area 
below an eastward step matches the number of northward steps that precede 
it. Thus, (hg is the number of tilings of a 1 x n board with exactly k blue 


tiles. 
n n— ‘) = (; a :) 
= +a" (5.38) 
& : ( k ), ie 


The identity 
for instance, is easily seen to hold using this interpretation. Indeed, tilings 


that end with a blue tile contribute q”~* (fo) 9 while tilings that end with a 
n—-1 


red tile contribute ( : der The symmetry identity (hg = ce a is explained 


within the frame of this tiling interpretation in [3, p. 101]. Identity (5.6) may 
then be obtained by reasoning as we did for identity (5.38). Consider the set 
of tilings of a 1 x n board with n — k blue tiles. The contribution of tilings 


that end with a blue tile is q* ae =e ("a dat while the contribution of 


those ending with a red tile is na = (a) 


Finally, we observe that the area-weighted path, and correspondingly the 
box-and-ball choosing or the weighted tiling interpretations of g-binomial 
coefficients, generalizes to Lucasnomials. Suppose Lucasnomials satisfy a 
Pascal-like relationship 


Gn =n(") ') tncr(Q a (5.39) 


for all integers n and k, for some sequences (x,,) and (y,,). (Note that taking 
n = 1 and k = 0 implies x9 = 1, while n = k = 1 yields yo = 1.) Say a 
horizontal (eastward) edge of height h has weight y, and a vertical (north- 
ward) edge at distance d from the y-axis, i.e., of abscissa d, has weight xq. 
Define the weight of a path from (0,0) to (k,n — k) as the product of the 
weights of its edges. Then (he is the sum of the weights of all paths from 
(0,0) to (k, n—k). This can be seen by induction on n using (5.39). Note that 
Ge = 1 and the only path from (0,0) to (0,1) is made of a vertical edge of 
abscissa 0 and, thus, of weight 29 = 1. Similarly, Gis = 1, and the one-edge 
path from (0,0) to (1,0) has weight yo = 1. Paths from (0,0) to (k,n — k) 
that reach (k,n — k) through (& — 1,n — k) followed by a horizontal edge of 
height n—k account for a cumulative weight equal to yn_z ea) oc The other 
paths must go through (k,n —k—1) and end with a vertical edge of abscissa 
k. Hence, their cumulative weight is x, ear The conclusion follows from 
(5.39). 

The corresponding “isomorphic” tiling interpretation of (7), uses a 1 x n 
board with k blue and n—k red square tiles. The weight of a blue tile, which 


is preceded by m red tiles, is ym, and the weight of a red tile, preceded by 
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m blue tiles, is x,,. The weight of a tiling is the product of all the individual 
weights of its tiles. Then [leg is the sum of the weights of all tilings that use 
k blue tiles and n — k red ones (or vice versa). 

Note that the Pascal-like identities (5.6), (5.7), (5.13), (5.20), and (5.21) 
are all special cases of the general type (5.39). 

Let us put to use this tiling interpretation mimicking an argument made in 
[3] in the context of Gaussian coefficients. The argument works if we require 


the sequences (z,,) and (y,) to be geometric. 


Theorem 81 Suppose U is a fundamental Lucas sequence and (2m) and (Ym) 
are geometric sequences for which (5.39) holds. Then 


2 


ei ), 7 Sat unt (Z), (5.40) 


U 


Proof A tiling of a 1 x 2n board with n red and n blue tiles can always be 
viewed as a concatenation, (1 x n) x (1 x n), of two consecutive half-boards. 
If there are k blue tiles in the first half-board, then there are n — k in the 
second half-board. Each of the & red tiles in the second half-board sees its 
local weight (i.e., its weight within this second half-board) multiplied by x, 
due to the presence of the extra k blue tiles in the first half-board. Each of the 
n—k blue tiles in that second half has an extra n—& red tiles to its left lying 
in the first half-board. It sees its weight multiplied by y,_,. Globally, we have 


to multiply the product of the weights of the two half-boards (i x oes 


by the factor a . yk, Summing over all possible k yields the result. 


The identities (5.7) and (5.21) involve geometric sequences. Thus, using, 
say, (5.7), we obtain 


Qn = = Pee? a) BAI 
(7), ae i k} iy aa 


It is easy to verify that if (5.39) holds with (am) and (ym) geometric, i.e., 
with x, = a” and y, = b” for all m > 0, then a+b = P and ab = Q. 
Hence, (5.39) is necessarily one of the two identities (5.7) or (5.21). 


5.5 Lucasnomial Catalan Numbers 


5.5.1 Introduction 


The Catalan numbers C;,,, although they pop up in various areas of mathe- 
matics and have very many combinatorial interpretations, are often defined 
via the simple algebraic expression 
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1 2n 
Cn = (7). (5.42) 


It is not necessarily immediate from (5.42) that they are integers. How- 
ever, C;, is the difference (7”) — (,2",). We have Cp = C; = 1, C2 = 2, 
C3 = 5, Cy = 14, Cs = 42, Cg = 132, and C7 = 429. The ratio Cr41/Ch 
tends to 4 as n tends to infinity. They satisfy the convolution identity 
Ch41 = ae C;C;,_;. Their generating function C satisfies eC? -C+1=0 
and is bv A rich book [76] was written on their various combinato- 
rial interpretations, properties, and generalizations. Here, we single out two 
specific generalizations, the Fuss-Catalan and the generalized Fuss-Catalan 
numbers. Given a > 2 and r > 1 integers, the generalized Fuss-Catalan 
numbers C,,-(n) are defined as 


Chi __ (“" +r— ‘) ee: é + e (5.43) 


(a-—1)n+r n an+r n 


for all n > 0, and are integers. In fact, they possess at least one combina- 
torial interpretation, counting Raney sequences [33, pp. 359-363]. The Fuss- 
Catalan numbers correspond to the case a > 2 and r = 1 and have many 
interpretations that extend some interpretations of the Catalan numbers. 

We next illustrate how a few classical interpretations of the Catalan num- 
bers extend to C3,1(m). The Catalan number C,, is: 


1. The number of triangulations of a convex (n+2)-gon into n triangles (using 
n — 1 non-intersecting diagonals) 

2. The number of meaningful parenthesizations in the product of n + 1 ele- 
ments @, * @2 *+++ * @n41, Where * is a binary operation 

3. The number of paths with (1,0) or (0,1) moves from (0,0) to (n,n) which 
start with a (1,0) move and do not cross over the line y = x 

4. The number of ballot sequences of length 2n, i.e., finite sequences (a;)1<i<2n 


with terms a; = +1, nonnegative partial sums ye a;, and zero complete 
sum peas a; = 0 
5. The number of non-crossing partitions of [n] = {1,2,...,n}, ie., such that 


ifa<b<c<d,aandc belong to the same block B, b and d to B’, then 
B=B8' 
6. The number of binary trees with n vertices 


Correspondingly, we find that C3,1(n) = Se (°”), the n-th Fuss-Catalan 
number with a = 3, is: 


— 


The number of slicings of a (2n + 2)-polygon into n quadrilaterals 

2. The number of meaningful parenthesizations in the product of 2n + 1 
elements ay * G2 *+++* G@2n41, Where * is a ternary operation (elements are 
multiplied three at a time) 
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3. The number of paths with (1,0) or (0,1) moves from (0,0) to (2n,n) which 
start with a (1,0) move and do not cross the line x = 2y 

4. The number of finite sequences of length 3n, (a;)1<i<an, with terms a; € 
{1,—2} and nonnegative partial sums )y*_, a; and 373", a; = 0 

5. The number of non-crossing partitions of [2n] into blocks all of even car- 
dinality 

6. The number of planar ternary trees with 2n + 1 vertices 


The point of this section is to present some generalizations of Catalan 
numbers with respect to Lucas sequences and some recent work on these 
generalizations. Clearly, there is much yet to discover. 


5.5.2 Definition and Integrality 


Given a nondegenerate fundamental Lucas sequence U = U(P,Q) and inte- 
gers a > 2 and r > 1, it is easy and tempting to define generalized Lucasno- 
mial Fuss-Catalan numbers Cy,-(m) as 


U, oil U, 
Cast) = an+r _ an+r (5.44) 
U(a—1)n+r n U Uantr n U 


in algebraic analogy to (5.43). If r = 1, then we obtain the Lucasnomial 
Fuss-Catalan numbers 


1 an 
Cu,a(n) = Cu,a,1(7) = Oia ( a ) . (5.45) 
a—1)n U 


The Lucasnomial Catalan numbers Cy(n) := Cy,2,1(n) are defined by 


Cie Gr (5.46) 


Un41 n 


When U is the Fibonacci sequence F’, then they are known as Fibonomial 
Catalan numbers. Here as in other instances in the theory of Lucas sequences, 
the discovery of these Lucasnomial Catalan generalizations came from two 
independent sources: the Fibonacci and the q-numbers. On the Fibonacci 
side, the discovery seems to date to the early 1970s when Gould [36] listed 
the first 50 Fibonomial Catalan numbers. Gould [35] gave a proof of their 
integrality. This proof would in fact carry over to all Lucasnomial Catalan 
numbers for U aregular sequence. Actually, replacing n and k by, respectively, 
2n and n in Equation (5.13), one obtains 


2n 2n—-1 2n—-—1 
( ) =Unin( ) = QU -( ) , 
niu n u n—-1 U 
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which, after division by U,41, is readily seen to yield 


caoe(™ eC) om 


as noted in [25]. Thus, Lucasnomial Catalan numbers are integers with no 
restrictive hypotheses on U. Lucasnomial Fuss-Catalan numbers are also in- 
tegers for all choices of U; see [13, Thm. 6]. A combinatorial interpretation 
of Lucasnomial Fuss-Catalan numbers, valid for all choices of U, was found 
recently [19], as we will see in Sect.5.5.2. Incidentally, this interpretation 
provides a second integrality proof. Generalized Lucasnomial Fuss-Catalan 
numbers were shown to have a nonnegative p-adic valuation for all regular 
primes p in [13, Thm. 9] and so to be integers for U regular. Their valuation 
with respect to a special prime p was shown to be nonnegative a little later 
[15, Thm. 6]. Hence, generalized Lucasnomial Fuss-Catalan numbers are al- 
ways integers as well. However, to this day, no combinatorial interpretation 
for these numbers is known. 


On the q-number side, several g-generalizations of the Catalan numbers 
were discovered and studied in the 1960s and 1970s by MacMahon, Carlitz, 
and Pélya in particular (see [30] for a list of references). They were called 
g-Catalan numbers. Further encompassing generalizations [30] were proposed 
and given combinatorial interpretations. One may also consult Exercise A43 
(76, p. 121]. 


The Carlitz g-Catalan numbers c,(n) derive very naturally from both the 
interpretation (5.36) of Gaussian coefficients and the interpretation 3. of the 
Catalan numbers stated in Sect. 5.5.1. They satisfy 


eg(n) = > g@®), (5.48) 


PECn 


where C,, is the set of Catalan paths, i.e., paths from (0,0) to (n,n) that 
remain below the segment joining (0,0) to (n,n). Here, a(p) is the area sub- 
tended by the path p. Actually, they were given the equivalent interpretation 


ana gree, (5.49) 


wEeWn 


where W,, is the set of binary words w = w w2-:+: Wen of length 2n such 
that the number of 1s never exceeds the number of Os in any prefix of the 
word w and inv(w) is the number of inversions, i.e., of (i,j)s, i < j, such 
that w; > w;. Each such word is in bijection with a Catalan path if a 0 
corresponds to a horizontal edge and a 1 to a vertical edge. A moment of 
thought and one sees that the area under a path is also the sum of the row 
areas east of vertical edges, which is what inversions measure. 
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Another natural qg-generalization of Catalan numbers is obtained from 
(5.46) on replacing U,, by the g-number 4 =. This generalization was traced 
back to MacMahon [57, Vol. 2, p. 214] in the paper [30]. The MacMahon 


q-Catalan numbers C,(n) had the following combinatorial interpretation: 


Cii= 5 ea, (5.50) 


wEeWn 


where W,, is the set of binary words w = w 1w2--- Wen as defined above and 
maj(w), the major index of w, is the sum iieteseaadd i. For instance, 


6@= >: (3) eal) le ee 


3 qg 


e=Dig=1) @ri@—1) 
=(P-qtiI(Pgt+P+Pt+qtHY=C4+P4+ P+ P+. 


The five sequences in W3 contribute, respectively, 


000111 0] 


qd 
001011 = q° 
001101 q 
010011 =? 
010101 q?t*=¢° 


Remark 29 The number C,(n) is also a sum of weighted paths. Indeed, the 
interpretation of C,(n) using the major index has an isomorphic weighted 
path interpretation when, in place of W,, we return to the set of Catalan 
paths C,,. Define the weight of a Catalan path as the product of the weights 
of all its edges, where a horizontal edge of height h has weight q” but only if 
that edge is immediately followed by a vertical edge. Similarly, the weight of 
a vertical edge of abscissa d is q%, if it is immediately preceded by a horizontal 
edge and if d < n. All other edges have weight 1. Then 


C,(n) = S- w(p), 
pECn 
where w(p) is the weight of the path p. 
Remark 30 The identity (5.47) applied to g-binomial coefficients yields 


2n—1 2n—1 
c= (1), na) 
q q 
With the tiling interpretation of g-binomial coefficients with red and blue 


square tiles, isomorphic to the area-weighted interpretation, the above iden- 
tity gives a combinatorial interpretation of C,(n) as tiling weights of 1 x 2n 
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boards with n+ 1 blue tiles. It is the weight of all such tilings that start with 
a blue tile minus the weight of all those tilings such that the last blue tile is 
immediately preceded by a red tile. 


It would be worth examining whether the various g-Catalan numbers and 
their interpretations could find extensions in the context of Lucasnomials. 


5.5.3 Interpretation of Lucasnomial Catalan Numbers 


The only existing combinatorial interpretation of Lucasnomial Catalan num- 
bers Cy(n) to date [19] is based on a small alteration of the interpretation of 
Lucasnomials obtained in (5.35) at the end of Sect. 5.4.1. We now describe it 
with the notation and terminology we had used to obtain (5.35). 


Consider the region S2, and a tiling T of Sg,. Assume k = n— 1. Let p 
be the path from (n — 1,0) to (0,2n) associated with T. Then B = Br is 
the partial tiling obtained from T by erasing tiles right of NI steps and left 
of NL steps. A subtiling C of T, slightly modified from B, is now defined. 
The Catalan partial tiling C is identical to B in all rows except for the initial 
row of length 2n — 1. The first row of C is left fully untiled if the initial step 
of p is northward, i.e., if it is an NI step. So compared to B, the tiling C 
has an additional string of n — 1 empty cells left of the NI step. If the initial 
step is westward, yielding an NL step, then C only retains the domino east 
of that NL step, thus leaving an extra string of (2n —1)—(n—2)-2=n-1 
blank cells to the right of the domino compared with B. Note in passing that 
C = Cr is sufficient to re-trace the path p associated with T. By (5.32), 
w(B) = w(C) x w(1 x (n — 1) board) = w(C) - U,. If B is derived from T, 
then w(T) = Dn-1-w(B). Since |S2n| = 10, pep W(B)- Dn-1, we find that 


|Son| 1 
fet = a(n), = 


Hence, as the Lucasnomial Catalan number Cy(n) = Ton ar is equal to 
r Ge we see that 


n)=S°S w(C), (5.51) 
pC 
where the outer sum is over all tilable paths p from (n — 1,0) to (0,2n) and 


the inner sum over all Catalan partial tilings C associated with p. 


Remark 31 The above interpretation is applicable to explaining combinato- 
rially the identity (5.47); see [19, Prop. 4.3]. 
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Extending the idea presented above, i.e., by defining partial tilings from 
paths in S(41)n,; the authors [19] also provided a combinatorial interpretation 
of Lucasnomial Fuss-Catalan numbers 


Ce dh= 1 Grae 


Van+1 nr 


5.5.4 The Search for Catalan-Like Triples 


Given a nondegenerate Lucas sequence U = U(P,Q) and integers a > 2 and 
k, define Du,a,~ as the set of positive integers n such that U(a—1)n+4x% divides 
(oss The complementary set of Dy,a,, in the positive integers is denoted 
by Du.a,x- Because for all integers n, |Un(P, Q)| = |Un(—P, Q)|, the two sets 
Dup,Q),a,k 82d Dy(—P,Q),a,k are identical. Hence, we may and will assume 
that P > 0. 

If k = 1, then since Lucasnomial Fuss-Catalan numbers (5.45) are integers, 
Du,a,1 = N. Are there other triples (U, a,k) with k # 1 for which Dyu,a,, = N? 
Such triples are called Catalan-like triples for obvious reason. If U = U(2,1) = 
I, the sequence of natural numbers, then there are no Catalan-like triples of 
type (I,a,k) with k 4 1. Pomerance [69, Thm. 1] showed that Dy» is 
infinite when k # 1 with a marked difference in the size of D7, depending 
on whether k <Q ork > 1. If k <0, then Dy2, has an upper asymptotic 
density less than 0.25135. If k > 1, then D;2;, has asymptotic density 1; see 
(69, Thm. 2]. 

A subset S of the positive integers has an asymptotic density equal to 
d = d(S) € [0,1] if a certain limit exists and is d, i.e., if 

#5(N) 


lim ———=d 
N-0o N 


o] 


where #:5(V) is the number of integers n in S within the interval [1, N]. The 
upper asymptotic density of S, 5(S), is the upper limit, or the limsup (or 
lim), of the sequence (#4$(n)/n). The lower asymptotic density of 9 is given 
by the liminf of (#S(n)/n). 

Pomerance [69, Thm. 3] had shown that if k < 0, then D; 2.x is infinite, 
but small with 6(Dr2.%) < 1—log2, a bound that was lowered to 1 — log 2 — 
0.05551 = 0.25134 by Sanna [72]. The argument of Pomerance was extended 
to all a > 2 [13, Thm. 15] to show that for k < 0, 6(Dra.k) < 1-—log2. It 
follows that for k < 0, Dias has a positive lower asymptotic density and, 
in particular, is infinite. The set Dar is also infinite for all a > 2 and all 
k > 2 by [13, Prop. 29]. It is thought that D7.«,, is infinite for all a > 3 when 
k <0. A partial result in this direction is obtained in [13, Thm. 16]. We point 
out that although we know Dy,20, the set of n > 1 such that n divides the 


central binomial coefficient eae is infinite, and has small upper asymptotic 
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density, it is not known whether it has a positive lower asymptotic density; 
see the discussion in [13, p. 17]. 

Moreover, Ballot [13, Thm. 37] showed that for U(P,@Q) any regular se- 
quence and for any a > 2, Dy.a,, has asymptotic density 1 for all k > 1. On 
the other hand, he showed [13, Thm. 14] that for k < 0 and U regular and 
standard (P? 4 4Q), the set Dy.a., is finite. Hence, the only hope to find 
Catalan-like triples for U regular and k £ 1 must occur for k > 2. 

For U(P, Q) regular, all Catalan-like triples are known. There are precisely 
five such triples with P > 0. All are of the shape 


(U,a,k) = (U(1, Q), a, 2). 


The first one found is (U(1, 2), 2,2). It was shown in [12, Thm. 3.5] to be the 
only example with a = 2. The others were determined in [13, Thm. 30]; two 
occur with a = 4 (Q = 2 and Q = 3): one for a = 6 and Q = 5 and the 
remaining one for a = 12 and Q = 2. The regularity hypothesis was made 
because the search relied on the classification [1, 20] of all regular Lucas 
sequences U with terms U,, lacking a primitive prime divisor; see also our 
Sect. 4.5 on primitive divisors. It is an important theorem of [20] that if U is 
regular, all terms U,, have a primitive prime divisor if n > 30. The sequence 
U(1, 2) is outstanding in this respect because it is the only regular sequence 
having defectivet terms U, with n > 13. If it were not for this sequence, 
the theorem on primitive prime divisors would hold for n > 12 rather than 
n > 30. The question of finding some combinatorial interpretation for the 
numbers 7+ ear for U = U(1,2), arises, as it does for the four other 
n+2 

Catalan-like triples. 

It was discovered [13, Thm. 38] that given any triple (U,a,k), U regular, 
a> 2, and k > 1, there exists a minimal positive integer m = m(U, a, k) such 
that for all n > 1, 


m an ' : 
—_ ( ) is an integer. 
Ua—1)n+k nu 


We say the triple (U,a,k) belongs to m. Thus, all triples with k = 1 and 
the five Catalan-like triples belong to m = 1. It was found that the triples 
(F, 2,2) and (F,3,2), resp., belong to m = 2 and m = 3; see [12, Prop. 3.2 & 
4.2] and [13, Rmk. 19]. 

Integrality proofs used in the papers [12, 13, 69] are arithmetic: They show 
that for all primes p, the p-adic valuation of say C); a or ie is at 
least equal to the p-adic valuation of, resp., 2 +1, Un41, or Fon+2. The p-adic 
valuation of the terms of Lucas sequences was studied in Sects. 2.4 and 2.5 and 
for special primes in Sect. 2.12. The p-adic valuation of binomial coefficients 


+ Un is defective if it does not have a primitive prime divisor not dividing D; this notion is 
entirely different from that of a defective integer m with respect to a sequence X defined 
in Sect. 4.6. 
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and Lucasnomials in general is the object of the next section. In particular, 
a generalized Kummer rule is available to determine the p-adic valuation of 
Lucasnomials. 


5.6 The p-Adic Valuation of Lucasnomials 


The p-adic valuation of Lucasnomials oo) y: p aregular prime, is amenable 


to generalized Kummer rules on the one hand. On the other hand, several 
papers have determined the p-adic valuation of some types of Lucasnomials, 
mostly Fibonomials, in direct ways. 


5.6.1 A Generalized Kummer Rule 


We first recall and re-prove Kummer’s rule [53]. 

Suppose p is a prime number. The Kummer rule says that the p-adic 
valuation of the binomial coefficient ) is the number of carries that occur 
when adding m and n in base p. For example, to find out the value of v3 eae 
we execute the base-3 addition of 24 and 5. Since 24 = 2-9+2-3 = (2,2,0)s3 
and 5 = 1-3+2-1 = (0,1,2)3, this addition generates two carries. Thus, 
3? || (2). Indeed, (7) = 3?-5-7-13- 29. 

Let us prove the Kummer rule. Suppose the base-p expansions of m and 
n are 


m= mep’ + mip? 1 +++» +mo, 


n= nep! +ne-ip’ | +--+ +10, 


where all p-ary digits of m and n, m; and n,;, belong to [0, p—1], and mg+nz > 
0. Let i > 0. Then, as one performs, in successive left-to-right columns, the 
base-p addition of m and n, a nonzero carry c; = 1 occurs between columns 
iandit+1 iff (mo +mip +--+ mip) +(no+nipt:--+nip') > ptt. That 
is, 

m n 


On the other hand, using the Legendre formula v,(N!) = 575, eels we find 
that " 
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For two real numbers x and y, the expression |x + y| — |#]| — |y] is either 
0 or 1. It is 1 if and only if the sum of the fractional parts {x} + {y} > 1. 
Hence, 


mtn m n m n 
ba | [aa | a | oe (pat + ar} 2. 


It follows from (5.52) that the p-adic valuation of (""'") corresponds to the 


n 
number of carries c;, 7 > 0, in the base-p addition of m and n. 


Ward [79] uses a similar reasoning to prove that generalized binomial coef- 
ficients with respect to a strongly divisible sequence X = (x,,)n>1 are always 
integers. Define a sequence X of integers to be regularly divisible if for any 
prime p and integer a > 1 such that p* divides some term zy, n > 1, p* | x4 
iff pa | t, where pa := p(p*) is the rank of p* in X. If some power p* of p 
does not divide any term of X, then we set pg = oo. Ward first proves that 
a divisible sequence X is strongly divisible if and only if it is regularly di- 
visible. Legendre’s formula becomes Vp(%n2%n—1°--%1) = 434 [n/pa]. Since 

ag is an integer iff vp("'”) 
follows by observing that 


y 2 9 for all primes p, their integrality 


Pa Pa Pa 
for all p and a>l. 


Knuth and Wilf [52] expanded on the ideas of both Kummer and Ward. 
In fact, they prove that a sequence X of positive integers is strongly divisible 
iff X is regularly divisible [52, Prop. 2]. Essentially, the small difference here 
with Ward’s result is that the equivalence is proved without assuming X to 
be divisible. Knuth and Wilf give Kummer rules for Gaussian coefficients 
and Fibonomials. We derive below a Kummer rule for general Lucasnomials 
which is an easy adaptation [11, Sect. 4] of the rule for Fibonomials. 


Theorem 82 (The Kummer rule for Lucasnomials) Let U(P,Q) be a 
nondegenerate fundamental Lucas sequence and p{ Q a prime of rank p and 
rank exponent v. If p is odd or if PQ is even, then the p-adic valuation of the 
Lucasnomial eeu is equal to the number of carries across or to the left 
of the radix point when adding m/p and n/p in base p, where a carry across 
the radiz point has weight v instead of 1. If p = 2 and PQ is odd, then the 
same counting applies except that the weight of a carry between the first and 
the second place left of the radix point is v2(P? — 3Q) instead of 1. 


Remark 32 An alternate wording of Theorem 82 is possible carrying out the 
addition of m and n in the mixed base (...,p°~!p,..., pp, p,1). Thus, m = 
mep’—tp+-+++mp+mo, where 0 < mo < p and 0 < m; <p fori> 1. The 
advantage is that there are no fractional parts, but the disadvantage is that 
we are usually less familiar with the use of mixed bases. Of course, a carry 
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emanating from the initial column is worth v, and one from the second to 
the third column is 1 unless p = 2 and PQ is odd when it is v2(P? — 3Q). 


Proof Suppose first p is odd or p = 2 and PQ even. Then with the nota- 
tion m!ly = UmnUm_1-::U1, we have, by Theorem 8, a generalized Legendre 


formula 
vp(mly) =~ Fal (5.53) 


1 Lelp’) 


Thus, using Corollaries 11 and 15, we find that 


vabmtu) =S-|™] 4 > 7 


4t—V 
i=1 i>v4+l PP 


“FZ Lal 


5S a eg eile) 


eer | Led Les). 


i>0 


Hence, 


The first term on the RHS above is either 0 or v. It is v iff {m/p}+ {n/p} > 1, 
ie., iff a carry occurs over the radix point in the addition of m/p and n/p in 
base p. As in the proof of the Kummer rule for binomial coefficients, a carry 
occurs between columns i and i+ 1 (¢ > 0) in the addition of m/p and n/p 
iff { mie} +{ we} > 1, which happens whenever 


ell [3 = 


A slight hiccup in the above argument occurs when p = 2 and PQ is odd. 
This is explained by Remark 16. Here, p = 3. Putting ¢ = v2(P? — 3Q) and 
observing that p(2') = 6 for all i, v+1<i<v-+ce, we see that 


vin = >| F]+ 2 [Fl+ Slats] 


i=1 ti=v4+1 i>v+c4+l 


rls] rele] +S bea 


i>l 


Hence, 
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(" + my 
V2 
h 7 
= m+n m n sis m+n m n 
- 3 3 3 6 6 6 


+E (See | - [are |- [3s], 


i>0 


mtn | _|t|_|2%| is 1 precisely when a carry occurs between 


The quantity | 3 
the first and the second digits left of the radix point in the addition of m/3 


and n/3 in base 2. Its weight is c. 


Example 83 Suppose we wish to know the 2-adic valuation of the Fibono- 
mial ('*) ,. Note that p(2) = 3, v = 1, and 12(P? — 3Q) = 2. A direct 
calculation using Theorems 8 and 14 gives 


18 os Fig: ++ Fig o Pio Fi5 Fis 
TS ep Ap F3Fs 


(base 2). 


Now V2(F\2F\5 Fg) = (1 + V2(F¢)) + (0 + V2(F3)) + (0 + V2(F¢)) = 4+ 
1+3 = 8, while 2(F3Fs) = 1+ 3 = 4. Thus, 12(’9),, is the difference 
8—4 = 4. Using Theorem 82, we need to add 11/3 and 7/3 in base 2. 
Since 11/3 = (1,0,0,1)2 + 2/3 and 7/3 = (0,0,1,0)2 + 1/3, we see that 
there is a carry across the radix point as 2/3 + 1/3 = 1 > 1 which creates 
another two in the first two places left of this point. So we find three carries 
with the middle one worth 2. Alternatively, using Remark 32, in the mixed 
base (...,pp,p,1) = (...,6,3,1), we have 11 = 6+3+42 = (1,1,2) and 
7 =6+1=(1,0,1). There are again three carries, and the second one has a 
weight of 2. 


Application 84 Let us use an arithmetic argument to prove that the Fi- 
bonomial Catalan number C'r(n) = ra ‘ear (n > 0) is an integer. If Cr(n) 
is a non-integral rational number, there must exist a prime p for which its 
valuation is negative. Since 7”) p 1S an integer, p must divide F,,41. Thus, 
by Theorem 8, there is a positive integer \ such that n +1 = App, where 


p = p(p), e > 0, and pf A. By Lemmas 11 and 13, we see that 


(Fast) vt+e, if p is odd; 

V, n = ; 

es v+((e—1)+c)-le>1], ifp=2, 

where again v is the rank exponent of p and here, since F = U(1,-1), 

c = ¥2(P? — 3Q) = 12(4) = 2. To find out the p-adic valuation of (*") ,, we 

add n/p to itself in base p and count carries according to Theorem 82. Since 

1 —1 

es ee ene | eed 
p p p 
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we see that the e least significant p-ary digits of Ap® —1 are all p—1 and that 


the fractional part of n/p is (9 — 1)/p > 1/2. Hence, this addition produces 
at least e + 1 carries. By Theorem 82, these carries imply that 


2n y+e, if p is odd; 
Vp > ; 
nr) p v+((e—1)4+c)-[e>1], ifp=2. 


Hence, vp (1s > Vp(Fn41) and vp)(Cr(n)) > 0. This is a contradiction. 


Remark 38 The same argument as in the above application would show that 
the p-adic valuation of a Lucasnomial Catalan number Cy(n) is nonnegative 
ifp{ Q. Ifp| Q and pt P, then by Lemma 6, no term of U is divisible by p. In 
this case, vp(Cy(n)) = 0. It remains to evaluate v,(Cy(n)) when p is a special 
prime, i.e., when p | gcd(P, Q). When p is special, then Theorem 46 describes 
v,(U,,) for all n > 1. It is then possible to evaluate the p-adic valuation of 
any generalized Lucasnomial Fuss-Catalan number Cy.q,,(n); see (5.44) for 
their definition. It was shown in the proof of [15, Thm. 6] that this valuation 
is always nonnegative. In particular, vp(Cu(n)) > 0 if p | ged(P, Q). 


There is no Kummer rule for evaluating the p-adic valuation of Lucasno- 
mials when p is special, so an explicit description was given in [15, Thm. 
1]. The main ingredient of the proof was Theorem 46. We restate the result 
below. 

Suppose U = U(P,Q), where P = p*P’, Q = pQ’, p is a prime, a and b 
are positive, and p { P’Q’. Let m and n be positive integers and r,, and ry 
be the respective remainders of the Euclidean divisions of m and of n by 2p. 
Then the next theorem holds. 


Theorem 85 The p-adic valuation of the Lucasnomial war is 
amn + v,(™*") yr [b= 2a], if b > 2a; 
m+n 
BL | tp 2)1) + (a+ vp(@E4)) [Pal tel: (Pal, afb < 2a, 


where U' is the Lucas sequence U(P’,Q’), c! = v,(P”? — Q'), [-] denotes as 
usual the Iverson symbol, and P, and P2 are the following conditions: 


e Pi: mandn are odd. 
e Po: 2<p<3and2a=b+4+1 andry+ry > 2p. 

Note that v, ees y can be evaluated using the Kummer rule for Lucas- 
nomials as p { Q’. Concrete examples of calculations using Theorem 85 can 
be found in [15, Sect. 3]. 


5.6.2 A Generalized Legendre Formula 


We provide a generalization to Lucas factorials n!y of the Legendre formula 


138 5 Some Properties of Lucasnomials 


vp(n!) = >> Fa a (5.54) 


k>1 p-l 


where s,(n) is the sum of the p-ary digits of n. 


Theorem 86 Let p{Q be a prime. If p = 2, we assume additionally that P 
is even or Q =1 (mod 4). Suppose p has rank p and rank exponent v in U; 
then, for alln > 1, we have 


eas | ety |" , Pln/e| = sel [n/o]) (6.88) 


b> LPk p p= 


where pp is the rank of p* in U and Sp 18 the base-p sum-of-digit function. 


Proof Suppose n = agp’—!p+: --+a,p+aq, where 0 < ap < pandO0 <a; <p 
for 2 > 1. Then, as seen at the beginning of the proof of Theorem 82, 


vow lo] 


k>1 ppk 
= Lae 


But, as [n/pp*| = i=), aiaip'*, we find that 


a eae 


k>0 pp* k=0 i=k 
e- i e-1 i 
= Qi41 be ae = S- Qi+1 S- p. 
i=0 k=0 i=0 u=0 
f— i+] 
pr-l 1 
= 2a a= 1 = 5a eln/el — sp(ln/el)), 


which proves the result. 


Note that if U = U(2,1), then for all primes p, we get p = p andv = 1. 
Moreover, ifn = ae ajp', then p|n/p| = n—ao and s,(|n/p}|) = 8p(n)—ao. 
Hence, if we apply (5.55) to U(2,1), we recover (5.54). 

If p = 2, P is odd, and Q = —1 (mod 4), then c = 12(P? — 3Q) > 2, and, 
as observed in Remark 16, the law of repetition applies not from p = 3, but 
from pp = 6. That is, V2(Uen) = V2(Ug) + V2(n) and V2(Ug) = V2(U3) =c>1. 
As seen at the end of the proof of Theorem 82, we have 


va(ntu) =» | + (¢ 0/7 >> Foal 


k>0 
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Since P is odd and Q = —1 (mod 4), U3; = P? — Q = 2 (mod 4) so that 
y = 1. If n = ap +413+426+a312+---+a23-2°-1, then 7,4 [n/(6-2*)| = 
ar = a4422'—*. Inverting the order of summation as we did in the proof 
of Theorem 86, we obtain the following complement to Theorem 86, namely, 
the equation 


va(nly) = H iif) H ~ (o/s 7 s2((n/3)) 


-e-|a}e=n[g|-s([3)}) 


For example, using (5.56) for F = U(1,—1), we find that c = 2 and 


V2(100!7) = 2 x 33 +1 x 16 — so(32+1) = 66 + 16 — 2 = 80. 


Remark 84 The two formulas (5.55) and (5.56) should coincide if p = 2, P is 
odd, and Q = 1 (mod 4). In this case, because c = 1, both analyses are valid 
provided we take into account that v = v2(P? — Q) > 2. Thus, they coincide 
if we replace the coefficient of |n/3| by v + 1 instead of 2 in (5.56). 


Remark 35 Tf for the Fibonacci sequence, the generalized Legendre formula 
(5.56) yields 
vo(nlp) = 2[n/3] + |n/6] — 82([n/3]), 


Phunphayap and Pongsriiam [67, eq. (9)] obtained another expression by a 
direct calculation based on the 2-adic valuation of Fibonacci terms, namely, 


V2(nle) = [(n+ 3)/6| + 3|n/6| + vo([n/6]!). 


5.6.3 Explicit Valuations 


As seen in Example 83, calculating the valuation of a Lucasnomial can be 
done directly or by using the Kummer rule for Lucasnomials. 

Several papers [58-61, 67, 68] have studied the p-adic valuation of some 
Fibonomial coefficients in a direct manner. We note in passing that, using 
similar methods, results for general Lucasnomials can surely be achieved. The 
paper [11] used the Kummer rule (Theorem 82) to improve upon Fibonomial 
results of the papers [58-61], but also included valuation theorems for general 
Lucasnomials. The proofs were often much shorter. However, the papers [67, 
68] established the p-adic valuation of larger collections of Fibonomials than 
in any preceding papers using direct elementary methods. 

For n > 1 an integer and p a prime, we have (2 ”) = oC Hence, p 
divides (*”) for all n > 1. It is easy to see that p | (°”) for all n > 1 if and only 
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if p | s. Marques et al., who initiated these studies, posed two corresponding 
Fibonomial questions: 


1. Given a prime p, when does p divide (?”) ,,? 
2. What necessary and sufficient condition must s satisfy in order for p to 
divide all Fibonomials of the type (*") ,,? 


In [58], the authors proved that 2 | ar iff n > 2. It is re-proved in a 
few lines in [11, Thm 3.1] using the Kummer rule. For p = 2, the answer to 
Question 2 is that 6 | s. The case p = 3 is solved in [59]. The prime 3 divides 
ee iff n > 2 and n £ 2-3", (k > 0). The set of exceptions is slim and 
is made to appear naturally by using the Kummer rule in a short proof [11, 
Thm. 1.1]. As for Question 2, s must be divisible by 12 [59, Thm. 2]. The 
case p = 5 is peculiar since v5(F,) = s(n) for all n > 1. Thus, (") and 
ie p have the same 5-valuation. In particular, Question 1 has an affirmative 
answer for all n > 1. The use of the Kummer rule was tested on the case 
p = 7 in [11, Thm. 3.6], but the proof was notably longer. Although the set of 
integers n for which 7 { Cy p Was explicitly described, it necessitated three 
parameters. Instead of going further along this method, it was shown that, 
given a prime p, the set of integers n < x for which p { Cle is O(log* x), 
where s is a nonnegative integer < p—2. In other words, divisibility of (””) F 
by p occurs on a set of asymptotic density 1. The remark was made that 
the same result holds if F' is replaced by any Lucas sequence U. Marques 
et al. [61] took n = p*, (a > 1), and found that if p = +2 (mod 5), then 
p divides (°") .,; they conjectured p { (*") ,, whenever p = +1 (mod 5). The 
conjecture was proved in [60], but also in [11, Thm. 2.1] with a short proof that 
used the Kummer rule. In [60], the authors also found the exact valuation of 


Cade for p = +2 (mod 5). (Their result is only accurate for primes whose 
rank exponent is 1 or for even exponents a.) Theorem 7.1 of [11] gave the 
complete p-adic valuation of the Lucasnomials Ca. for all U = U(P,Q) 
and all primes pt Q, 0 < a < b. This was done using the Kummer rule for 
Lucasnomials. Question 2 was addressed not only for the Fibonacci case but 
for any Lucas sequence U in [11, Thm. 6.1]: p divides le for alln > 1 iff 
lem(p, p) | s, where p is the rank of p in U. Thus, if p+ D = P? — 4Q, then 
the condition on s is that pp divides s. 

Phunphayap and Pongsriiam [67] managed by direct elementary methods 
to find explicit p-adic valuation formulas for all Fibonomials of the form 


b 
(ele where b > a > 1 and ¢, and fy are positive integers. Assume 


€,p? > lap%, and define s and r as the respective remainders of the Euclidean 
divisions of £;p° and fp" by p, the rank of p. Theorem 13 of [67] gives for 
primes p = +2 (mod 5) six separate formulas depending on the relative sizes 
of r and s and on the divisibility of @; and @2 by p. But for primes p = +1 
(mod 5), they find a single formula, which we give here 
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Vp oo = Vp ) + [r<s]-(a+v+v,(m—k)), (5.57) 


op* 
where m = [ae], k= nae and vy is the rank exponent of p in F’. It was 
shown [11, Thm. 7.1] that the p-adic valuation of a Lucasnomial of the shape 


(2), (0 > a@ > 0), is O if p | p—1 (and, incidentally, if b = a (mod 2), 
when p | p+ 1). Suppose U = F’, p = +1 (mod 5), and 0 < a < b. Then 
p|p—1 by Theorem 9. Thus, p divides p* — 1 and p® — 1. Hence, r = s = 1. 
Choosing ¢; = £2 = 1, we see that k = 0. Therefore, we indeed recover from 


Equation (5.57) the fact that v, ae = 0. Actually, we see from (5.57) that, 


for p = +1 (mod 5), i(%) = 0 (0 <a <b) as soon as p{ &. Indeed, pt ¢ 
implies r > 1 = s. Moreover, with (2 = 1, we obtain k = 0. 

In [68], given any prime p, the same two authors find formulas for the 
p-adic valuation of the Fibonomials (? Ba p: Where a > 1 is an integer. As an 
example, for p= +2 (mod 5), p ¥ 2, a even, this valuation is 


A a 

p-1 2 
where s = (n (mod p)), A = |m(p* — 1)/p|, and m = n/p’”?™. In some 
corollaries of their results, they are able to determine, given some p”, the set 
of integers n for which p divides ie a p> ® problem which extends Question 
1. For instance, they find that eta 

(1+6-8*)/7,k>0. 

Knowing the p-adic valuation of a Lucasnomial is zero should not prevent 


us from finding its residues modulo powers of p. (See in particular Sect. 5.8). 
In [29], the authors prove a result for specific Fibonomials, namely 


p 18 odd if and only if n is of the form 


pet 
( : ) =1+p+p* (mod p’*), 
Pp F 


for all primes p = +1 (mod 5) whose rank in the Fibonacci sequence is p — 1 
and all a > 3. 


5.7 Lucas’ Congruence 


Let p be a prime and B = (b;,;)o<i,j<p—1 be the p x p matrix of binomial 
coefficient b;,; = ((5) (mod p)). For instance, if p = 3, then 


100 
B=110 
121 
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Then the first 4p lines of Pascal’s triangle modulo p are 


B 
BB 

B (2B) B 

B (3B) (3B) B 


where multiplication is taken modulo p. 


In his classical memoir on Lucas sequences [56, p. 229], Lucas showed how 
constructing the Pascal triangle modulo a prime p lets the initial first p lines 
reappear further down the triangle but flanked with binomial coefficients 
and deduced from this structure the now-famous congruence for binomial 
coefficients modulo p. 


Theorem 87 [fp is a prime number and m and n are two nonnegative in- 
tegers, then Lucas’ congruence 


(2) = (n) te 


holds, where 0 < mini < p are the ith p-ary digits of m and n (see 
Sect. 5.6.1). 


MeStrovié wrote a thorough compendium [63] of the various proofs, ex- 
tensions, and applications of Lucas’ theorem that appeared in the literature 
up until 2014. Section 5 of [63] is dedicated to the generalizations of Lucas’ 
theorem which have to do with generalized binomial coefficients, including 
Lucasnomials. 

These days, the short formal proof of Lucas’ theorem given by Fine [26] 
is perhaps the most often reproduced. It appears on the Wikipedia page on 
Lucas’ theorem or on page 6 of MeStrovic’s paper [63]. It is often noted that 


& ‘ : ) = (") (7) (mod p), (5.58) 


where m, n, s, and t are nonnegative integers with s and ¢ less than p, then 
Theorem 87 holds. Thus, we may also prove Lucas’ congruence by noting 
that 


(L+a)y™ts = ((1+ xP)" (1 +a)? =(14+a2?)"(1+2)° (mod p). 


Thus, if [z*] f(a) denotes the coefficient of z* in the polynomial f(x), because 
s and ¢ are less than p, we see that 
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ert] ta)" = [| + aPy"- fat|(1 +2) 
“(0 tn 
Hence, (5.58) follows. 


There are generalizations of Lucas’ theorem—e.g., modulo higher powers 
of primes—which may themselves admit extensions to Lucasnomials. In this 
section, we limit ourselves to some of the attempts at generalizing the Lu- 
cas congruence (as it is) to Lucasnomials. Our material overlaps much of 
MeStrovié’s [63, Sect.5]. However, the presentation differs, a more uniform 
notation is used, some post-2014 references are cited, and a few remarks and 
improvements on existing results have been added. 

As usual, historically, the first attempts started with g-binomial coefficients 
and Fibonomials. In 1965, Olive [65] obtained the polynomial congruence 


Gor) = (™) or (mod q(q)), (5.59) 


which is a qg-analogue of (5.58), where d is a positive integer and ®4(q) the 
dth cyclotomic polynomial (mentioned in Sects.3.5 and 4.3). Note that if d 
is a prime p, then the irreducible polynomial ®,(q) is the usual g-analogue 
of the rational prime p. Also if d = p, we recover (5.58) by letting q tend 
to 1. Thus, (5.59) is a true generalization of Lucas’ congruence. As with 
many Gaussian coefficient congruences, their consequences and applications 
in terms of Lucas sequences await examination. In 1967, Fray [27, Thm. 3.11] 
gave the following generalization of (5.58): 


ei a ). = (™) (:) (mod p), (5.60) 


where q was viewed as a p-integral rational number, i.e., a rational number 
with both numerator and denominator prime to p, p is the order of g modulo 
p, and m,n, s, and t are nonnegative integers with s and t < p. 

The Fibonomial triangle is obtained by replacing the binomial coefficient 
(7) in Pascal’s triangle with the Fibonomial (') pr: Interest in the Fibono- 
mial triangle modulo primes began in the 1960s (see for instance [37]) and 
flourished in the 1990s [23, 24, 38, 74, 82, 83], often with the intention of dis- 
covering an analogue of Lucas’ theorem. This study was frequently limited 
to the primes 2, 3, or 7 or to primes p satisfying some particular condition. 
However, Wilson [83], on the one hand, and Wells [82] and Holte [38], on 
the other, found analogues of (5.58) that worked for general primes. Wilson 
showed that for a prime p not 2, nor 5, we have 


ae) — oy (7) mies (mod p), (5.61) 


where e = (np +t)(m—n)+n(s—t). 
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In 2001, Hu and Sun [41] found a general Lucasnomial extension of Lucas’ 
theorem that subsumes several of the previous results, in particular the q- 
analogue (5.60) of Fray and the Fibonomial analogue of Wilson (5.61). 


Theorem 88 Suppose U(P,Q) is a regular and nondegenerate Lucas se- 
quence. If q is a positive integer, then 


Gee a. (;) ven (mod P,), (5.62) 


where m, n, s, and t are nonnegative integers, s and t are less than q, e is 
(nq+t)(m—n)+n(s—t), and P, is the largest primitive factor of Ug. If q ore is 
even, then the factor Uj, in (5.62) can be replaced by (—1)t—ns)e-D) Qeq/, 


One easily checks that 


d 2). 5.63 
mt—sn+n(m—n), if ¢ is odd; cca ee) 


oe tie — sn, if g is even; 
By Corollary 39, requiring that U(P, Q) be nondegenerate when P and Q are 
coprime comes down to discarding the two sequences U(+1,1). If (P,Q) = 
(2,1) and q is a prime, then P, = U, = q and Ug4; = q+ 1 =1 (mod q). 
This shows that (5.62) is indeed a generalization of (5.58). It clearly extends 
the result of Wilson, since taking U = F' and q = p, we have p | P,. To see 
that it generalizes (5.60), assume gq is a p-integral rational number of order p 
modulo p. Then we find that 


p+l_ 4 p+1 _ op | P—] 
q _4 qet+4 agrae =1+0=1 (mod p). 


q-1 q-1 q-—1 


Hence, Us,, =1 (mod p). 
If p is a primitive prime factor of Uz, then (5.62) holds modulo p” since 


the rank p of p is q and v is the rank exponent of p. Hence, for all primes 
pt Q, we find that 


mpt+s m\([s é o 
= ‘ 64 
(p22), =(C) Qi ome 
Since 2U,41 = U,Vi + U1 Vp, we see that, for p odd, we also have 
mp+s m\ (s % x 
= 2 : . 
(rere) = CCE) oie (aod 2) (5.65) 


Remark 36 The congruences (5.64) and (5.65) hold without the hypothesis 
of Theorem 88 that U be regular. (The proof of Theorem 88 requires in [41, 
Lemma 4] that U, and U,+1 be coprime. However, if p{ Q, then we know by 
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Lemma 7 that p { gcd(U,,Uq41), which suffices for the congruence to hold 
modulo p, or p”, when q = p(p).) 


Congruence (5.58) is equivalent to Lucas’ theorem, but the existence of 
Lucasnomial generalizations of this congruence does not usually imply the 
wishful generalization of Lucas’ theorem: 


@ o II (™") 5. ete (5.66) 


where the m,; and n, are the digits of m and n in the mixed base described 
in Remark 32. For instance, the Fibonomial or = Fs = 5 = 2 (mod 3), but 


as pp(3) = 4, we have 5 =1-p4+1,1=0-p+4+1, and Glee = 1. However, 
in this regard, we do have a couple of corollaries to Theorem 88. 


Corollary 89 Let U(P,Q) be a fundamental Lucas sequence, and let p{ Q 
be a prime of rank p in U. Then, for all m and n nonnegative integers, we 


have 
p divides (™) <=> p divides II @ . ("*) 
n U i>1 NG no U 


where (m)i>0 and (n;)i>0 are the respective digits of m and n in the mixed 
base (1, 9, pp,..-,p"p,---)- 


Proof By (5.64), we have 


("), i (a Ge p+1 (mod p). 


By Lucas’ theorem, 


(inal) ELC) (mma 


i>l 


The corollary follows because, by Lemma 7, p{ U,p+1 since p | Up. 


From Corollary 89, we easily deduce the following result. 


Corollary 90 Let U(P,Q) be a fundamental Lucas sequence and p { Q a 
prime of rank p > p. Then, for all m and n nonnegative integers, we have 


p divides (”) <=> p divides II co ‘ 
MU iso SU 


where (m,;);>0 and (n;);>0 are the respective digits of m and n in the mixed 
base (1, p,pp,.--,p"p,---)- 
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Proof By Corollary 89, it suffices to check that for each i > 1, p | Ce) 
iff p | (es But, for i > 1, we have 0 < mj,n; < p. Thus, p | (") iff 


m,; <n;. By the Kummer rule for Lucasnomials (Theorem 82), p | (os iff 
either m; < _n;, or m; > nj; and {(m; — n;)/p} + {ni/p} > 1. But the latter 
condition holds iff |m;/p| — |(m; — ni)/p| — |ni/p| = 1 and that cannot 


happen since p > p entails that each of the three integral parts is zero. 


We observe that Corollaries 89 and 90 imply the naive version of Lucas’ 
theorem (5.66) for p = 2, since the rank of 2 in U is either 2 or 3. Thus, if Q 


is odd, then 
m Mj 
= ‘ (mod 2), (5.67) 
("") U II ( ) U 


where as before the m; and n,; are the digits of m and n in the mixed base 


(Dp 20 jnach2 hoe): 
In the proof of Corollary 90, we saw that for i > 1, p | (a implies 


p | Cis This implication holds even if p < p. Thus, with Corollary 89, we 
see that the divisibility of [],., ae by p is a necessary condition for p to 


divide er That is, we at least have the following result: 


Corollary 91 Let U(P,Q) be a fundamental Lucas sequence and p{ Q bea 
prime of rank p. Then, for all m and n nonnegative integers, we have 


p divides (”) = p divides T] (") 
ra iso \"i/U 


where (m,;);>0 and (n;);>0 are the respective digits of m and n in the mixed 
base (1, p,pp,.-.-,p"p,---)- 


The congruence (5.67) appeared in the 2014 paper [23] as Theorem 3.4 at 
least when U is the Fibonacci sequence. The authors checked the congruence 
via a case-by-case use of the Kummer rule for Fibonomials. 

In 2016, Southwick [74] conjectured Corollary 90 again in the context of 
the Fibonacci sequence. In 2018, Debellevue and Kryuchkova [24] worked out 
a direct proof of Southwick’s conjecture using the Kummer rule for Fibono- 
mials; i.e., they did not see it as a consequence of Theorem 88. Their main 
theorem, [24, Thm. 4.6], states that if p is an odd prime of rank p = p+1 in 
the Fibonacci sequence, then 


ee : ‘ ) oo (") oo (mod p), (5.68) 


where a > 0,0< 5 < p%p,0<t< p%p, and 0 < m,n < p. This congruence 
has nice consequences for the structure of the Fibonomial triangle modulo 
such a prime, since blocks of size p* reappear flanked by binomial coefficients 
further down the triangle just as Lucas discovered they did in the case of the 
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Pascal triangle, only up to a possible —1 factor this time. With sharp eyes 
or a magnifying glass, one can see the Fibonomial triangle modulo 7 in [24, 
Fig. 1]. 

We provide next a theorem which shows that the same phenomenon holds 
in broader generality. 


Theorem 92 Suppose U(P,Q) is a fundamental Lucas sequence with Q = 
+1 and p is an odd prime whose rank p is even (p = 2py ). Assume a, m, n, 
5, and t are nonnegative integers with s and t less than p*p. Then 


mp" p+ s m\ (s 
~ = Em,n,s,t (mod P). 
nptp+t/y n}/\t}y 


(-)™", #4|p orQ=1; 
Em,n,s,t = 
mee i if 2||p and Q=-1. 


where 


Proof Suppose first Q = —1. Note that when applying Theorem 88 to the 


Lucasnomial ara with q = p, we may, since Q = —1 and p is even, 


using (5.63), replace Ue, , by (—1)™—8"(—1)°v = (-1) ("#8 (14v)_ Write 
s = s'‘p+s9 and t = t/p + to, where 0 < s0,to < p. Using Hu and Sun’s 
theorem with g = p a couple of times and Lucas’ theorem, we find that 


ip) (eee) (ce 2) (9) ea 
=(NE)@) r= OG), 
=(") (3) Carts (oa x), 
where 


a = ((m+s')to — so(n + t’))(1 + pv) 
y = (s'to — sot’)(1 + py). 


Hence, «+y=((m+s')to — so(n +t’) + (s'to — sot’)) (1+ pv) 
= (mto — son)(1 + py) = (mt — sn)(1+ py) (mod 2), 


where the justification for the latter congruence is that, p being even, t = to 
(mod 2) and s = sq (mod 2). Thus, 


(=1)7*# = sy, if 4 | P; 
1 if 2|| p. 


d 
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If Q = 1 and pis even, then U’,; is replaceable by (—1)™—s”" in Theorem 88. 


Thus, our calculation of (—1)**¥ is identical to the case Q@ = —1 and 4 | p. 


Remark 37 For U = F, the case of an odd prime p of rank p = p+1 
(i.e., congruence (5.68)) is a subcase of “4 | p” in Theorem 92. Indeed, by 
Theorem 36, we must have (Q|p) = —1. Thus, since Q = —1, we have p= 3 
(mod 4). Hence, 4| p+1. Note also that, by Lemma 15, V,41 = 2Q (mod p). 
For these primes, we also recover (5.68) from (5.65). 


When Theorem 92 is applicable, it provides a very effective way of com- 
puting a Lucasnomial modulo p. In fact, we get the following corollary which 
is very close to Lucas’ theorem. 


Corollary 93 Suppose U(P,Q) is a fundamental Lucas sequence with Q = 
+1 and p is an odd prime with even rank p. Assume m, n, and a are non- 
negative integers with a > 1, m = 89 + 81p + Sopp +++: + Sap” ‘p, and 
n=to +tipttopp +--+: +tap*'p, where 0 < s9,to < p and 0 < 8;,t; <p 
for all 1 <i<a. Then 


a ~ I (*') | 6 yfmn (nod p), 


(—1)l™/e]to—soln/e] — (_1)* Lisi 81-80 Visite if4| por Q=1; 
Em,n = 
, 1, if 2||p and Q=-1. 


Proof By Theorem 92, we obtain 


(™) a ee “| 2 Ga (*") nse THOS: 
n}y ln/p|p+to]y n/p] / \to] yu 
and the result follows by applying Lucas’ theorem to the binomial coefficient 


[m/p] 
( [n/ol ) , 


Remark 38 The Lucasnomial triangle modulo p is easiest to construct when 
Q = —1land 2|| p. The initial triangle made of the first ¢ lines, from 0 to p—1, 
is repeated in the successive next blocks of p lines twice, three times, ...,n 
times, where the kth copy in the nth block is multiplied by () (mod p). 
Thus, once the initial triangle of p lines is built, the construction is similar 
and as convenient as for our ordinary Pascal triangle. 


Example 94 (Of the use of Corollary 93 to compute some Lucas- 
nomials modulo p) 


1.1 U =F = U(1,-1), p = 11, then p = 10 and 2||p. Say m = 252 and 
n = 131. Then 


5.7 Lucas’ Congruence 149 


252 =2x110+3x104+1-x 2, 
1381 =1x 110+2x104+1x 1. 


Hence, the mixed-base digits (s2,51,59) are (2,3,2) and (to,t1,to) = 
(1, 2,1). Thus, the Fibonomial modulo 11 is given by Corollary 93 as 


(st) G)G)(Q), atts totan 


2. If U = P = U(2, —1), the sequence of Pell numbers and p = 3, then p = 4. 


Hence, 


08) = (0) caproraoeteanica tte 
P P 


since [50/4] =1-9+1-3, so = 2, [13/4] =1-3 and tp = 1. 


We end the section by giving a few related theorems from the literature. 


Holte [39, Thm. 3] obtained the following curious, somewhat “Lucas-like” 
theorem, where T = T(p) is the least period of U,, modulo p. This theorem 
also allows for the computing of a Lucasnomial modulo a prime. 


Theorem 95 Suppose p{ Q is a prime, m and n are nonnegative integers, 
and A is the maximum of 0 and |rr(m)/p| + [rr(n)/p] — (p—1). Then 


@ 27 = (We + w/e) (Levine ane ee ay, 


n [n/p] Lrr(n)/p| +AM 
rp(m) + rp(n) + AT 
( rp(n) + XL ), ee. 


where p = p(p), T =T(p), rr(x) is the remainder in the division of « by T, 
and M =T/p is the multiplier. 

Remark 89 The multiplier M = M(p) for the Fibonacci sequence can only 
be 1, 2 or 4 (with equal frequency, i.e., each value of M occurs for a set 
of primes whose natural density is 1/3 [6, Thm. 5.1]; see Chap. 9 for prime 
density questions). This allows for a simpler expression of Holte’s theorem 
when U = F. 


Corollary 96 Suppose p # 5 is a prime and m and n are nonnegative inte- 
gers. Then 


’) r Gc, aa : oe ) “A 


ae + ~) . (mod p), 


rp(n) 
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where p = p(p), T = T(p), and rp(x) is the remainder in the division of x by 
T. Moreover, if 2|| p, then 


29), 


Proof By [6, Thm. 5.1] or [78], for p #5 a prime, we find that 


) (mod p). (5.69) 
F 


M(p)=1 <> 2\||por p=2, 
M(p)=2 <> Alp, 
M(p)=4 <> pis odd and p F 2. 


If M =1, then rr(x) < T = pso that A= 0. If M = 2, then rr(a) < T = 2p. 
Hence, [rr(m)/p| + |rr(n)/p| < 2, and since p—1 > 2, A = 0 again. If 
M =4, then [rr(m)/p| + [rr(n)/p| < 3+ 3. But the least prime p 4 5 for 
which M = 4 is p = 18 of rank 7. Since 13 — 1 > 6, we see that \ = 0 once 
more. If 2||p, then rr(a) = 0 for all integers x > 0 so the middle term on 


the RHS of the corollary is me This explains (5.69). 


Example 97 For the sake of comparison, we apply Corollary 96 to the same 
Fibonomial ea p and the same prime p = 11 as used in Example 94. Since 


13 
252\ (252) 131 + 121 
18h) @ M2 it. Je 
let us put m = 131 and n = 121. Since pr(11) = 10, we have 2]|| p, con- 


sequently, by the results in [6], M = 1. (Indeed, Fy; = 89 = 1 (mod 11).) 
Thus, rr(m) = rr(n) = 1. By (5.69) and Lucas’ theorem, we get 


131 +121 13+12\ (2) _ (2-143) | 
1 Jy 12 J, \l-ll+1 
2\ (3 
= (?)(3)-2%3=6 (aod), 


in agreement with our previous calculation. 


Il 


In 2002, Hu [40] published a result similar to Theorem 88, but for gen- 
eralized binomial coefficients with respect to companion Lucas sequences V, 
namely, 


Theorem 98 Suppose U(P,Q) is a regular and nondegenerate Lucas se- 
quence. If q is a positive integer, then 


(ma)(mas) a (™)(*) Cant (mtg), (670 
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where m,n, s, andt are nonnegative integers, m and n are even, s and t are 
less than q, f is (nq+t)(m—n)/2+n(s—t)/2, and V> is the largest factor 
of Vg prime to VoV, --- Vg-1- 


5.8 Wolstenholme’s Congruence 


In 1862, Wolstenholme [84] made a remarkable discovery: if p > 5 is a prime 
number, then the binomial coefficient Ga) is congruent to 1 modulo the 
cube of p. That is, 


& = ‘) =1 (mod p). (5.71) 


It had been observed by Babbage [4] in 1819 that congruence (5.71) held 
modulo p? for p > 3 since 


Gy > Gare 3 (?) = (8) +(2) =2 (ava 2 


Hence, dividing each side of the above congruence by 2 yields that ex =1 


(mod p”). Wolstenholme, in finding (5.71), got the intermediary interesting 
congruence for the harmonic number Hy _1, also valid for p > 5: 


1 1 1 


Ay-1=14 fees + = 
p-1 1 p—-l 


0 (mod p’). (5.72) 


(In [42, p. 93], the authors mentioned a remark of Rao who pointed out that 
congruence (5.72) and some of its extensions had been anticipated by War- 
ing in 1782. Thus, Babbage could have discovered Wolstenholme’s improved 
congruence. ) 

Deriving (5.71) from (5.72) only requires a few steps. Indeed, 


2p —1 > 
ey ~ (p— 1! I]e+9 
=1+pH,itp > 1 (5.73) 


rr ay 
O<i<j<p 


1 
=1+p’ x > (mod p?). 
O0<i<j<p J 


This yields (5.71) provided we verify that )locicj<p : is 0 (mod p). But 


Ha ys z+2 > > 


: ae ed 
O<i<p O0<i<j<p 
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and since p > 5, we have 


b> _ ss a eo Ure 1) 9 (mod p). 


0<i<p O<i<p 


Therefore, we do have )igcicjcp 5 = 0 (mod p). 

Prior to writing a survey on Lucas’ theorem, i.e., on Theorem 87, Mestrovié 
had written yet another large survey on Wolstenholme’s theorem and its 
various generalizations, this time up until 2012 [62]. However, MeStrovic’s 
survey, although it included a final section on g-analogues of Wolstenholme’s 
congruence, did not include the research aimed at finding a generalization of 
the Wolstenholme congruence with respect to Lucasnomials. This is chiefly 
the story we recount in the present section. 


To describe this research, we need to mention two early generalizations of 
Wolstenholme’s congruence, namely, the Wolstenholme-Glaisher congruence 
[31, 32] of 1900, which asserts that for all integers k > 0, 


oe l)p-1 


od ) =1 (mod p’), (5.74) 


and the further 1952 generalization of Brun et al. [21] which states that 


() = ("") (mod p*), (5.75) 


for all nonnegative integers m and n and all primes p > 5. 


Remark 40 Multiplying both sides of (5.74) by k+1, one obtains (5.75) with 
m=k+1 and n= 1. Clearly, (5.75) supersedes the Wolstenholme-Glaisher 
and the Wolstenholme congruences. However, in some sense, the Wolsten- 
holme congruence is the inner core of (5.75): Bailey [5] re-proves the Wol- 
stenholme congruence and uses it as a basis to obtain (5.74) and (5.75) by 
two successive inductive proofs. Similarly, this time by a combinatorial argu- 
ment, Stanley [75, Sol. of exercise 1.14, p. 165] reduces the proof of (5.75) to 
the Wolstenholme congruence (*?) = 2 (mod p®). 


Kimball and Webb sought some analogues and generalizations of these var- 
ious theorems with respect to Fibonomials and Lucasnomials in a series of pa- 
pers in the 1990s [47-50], starting the investigation, as often, with Fibonacci 
numbers. It is curious to observe that parallel and independent researches on 
q-analogues of all these same theorems were simultaneously carried out in the 
1990s and refined further in later years. We refer the reader to MeStrovic’s 
(62, Sect. 11] and only mention, for the sake of comparison with Lucasnomial 
results, the 1999 polynomial congruence of Andrews [2, Thm. 3] 


(MP) = (ME) gnome (mod By(a)?), (6:7) 
NP} q UY gp 
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m 


where p > 3 is a prime, (™ 


Ne is obtained by replacing every appearance of q 
by q? in Ca m and n are nonnegative integers, and ,(q) =1+q+@+ 
-.-+g?—! is the pth cyclotomic polynomial in q. By letting q tend to 1, we see 
that congruence (5.76) is a weak generalization of the Brun et al. congruence, 
since we only get the congruence modulo p?. 

Kimball and Webb started [47] by proving a p?-analogue of (5.75) for the 
Fibonacci sequence. Suppose p > 3 is a prime number and p is the rank of p 
in the Fibonacci sequence F' = U(1,—1). They find that 


("*) = e(m—n)np (™) (mod p?), (5.77) 
np) p n 
where 
8 if 2 || p; 
e=<¢-l, if 4 | p; 


V-1 (mod p”), if p is odd, 


and m > n > 0 are integers. (Note that when (m—n)np is odd, the theorem 
does not say which of the two square roots of —1 (mod p”) to select. For 
instance, if p = 5, then the theorem holds with « = —7, not 7. From (5.82), 
we will see that « = L,/2 (mod p*). Note further that if p is odd, putting 
U = F, V = L, and n = p¢ in (2.8) yields that L2/4 = —1 (mod p’).) 
Another remark: if p is even, then Theorem 92, which is a congruence that 


yields ("?) (mod p), remains valid modulo p? by (5.77). 
np) p 


Our two pioneers,’ Kimball and Webb, in their next paper [48], described 
some analogues of the congruence of Wolstenholme for the harmonic number 
Hy-1 (5.72), for the Wolstenholme-Glaisher congruence, and, under condi- 
tions on the rank of p, for the congruence of Brun et al. modulo p’, all in 
the Fibonacci context: Let F and LE designate, as usual, the sequences of 
Fibonacci and Lucas numbers. If p > 7 is a prime and its rank p in F' is p—1 
or p+1, then 


=! =0 (mod p?). (5.78) 
O0<i<p 


(It is an analogue of (5.72) as we may state (5.72) in the equivalent form 
Vocicp 2/t = 0 (mod p’), since for (P,Q) = (2,1), we have U; = i and 
V; = 2 for all i.) 

Much in the way we deduced Wolstenholme’s congruence (5.71) from Wol- 
stenholme’s divisibility of the harmonic number Hp-1 by p*, Kimball and 
Webb obtain a Fibonacci analogue of the Wolstenholme-Glaisher congruence 
(5.74). Suppose p > 7 is a prime number of rank p = p — ep, where 


+ The term “pioneer” is meant as a true compliment; too often credit only rests on those 
who put the final dot. 
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1, ifp=+1 (mod 5); 
—-1, if p =+2 (mod 5). 


€p = (5|p) = (p|5) 


Then 


k+1)p-1 
(‘ +) ) = & (mod p*). (5.79) 
p—1 F 
Finally, they found that under the same rank condition on jp, i.e., p = p — €p 
and €, = +1, we have for all p > 7, all integers m > n > 0, 
0 ee a (mod p?), (5.80) 
np) p nr) 


where F’ = U(L», (—1)?) = U(Lp, 1). 
They also pointed out a corollary of (5.80), ie., 


e = ie He=P— 1h od py, (5.81) 
F 


nt (one ifp=pth 


where T = T(p) is the least period of F modulo p. 


We remark here that if VU = U(2,1), Lucasnomials with respect to U are 
the usual binomial coefficients. In this case, py(p) = p = T(p). Therefore, 
it makes sense to look for analogues where either p or T replaces p in the 
binomial coefficient ee) of (5.75). 

In [49], Kimball and Webb obtained a generalization of (5.77) for general 
Lucasnomials. Suppose U(P, Q) is a regular nondegenerate Lucas sequence 
and p > 3 is a prime of rank p in U. Then, for all integers m > n > 0, 


ee = (7)! 2yn—nine (mod p?), (5.82) 


npJg \n 


where V is the companion Lucas sequence of U(P, Q). 

We note in passing that if we put s = t = 0 in (5.65), then we obtain 
precisely the above congruence albeit modulo p, rather than p?, since, by 
Theorem 88, e = n(m—n)p. 

From (5.82), they derive several congruences to express (oF ) yy modulo Dp, 
where again T is the minimal period of U modulo p. The case Q@ = +1 is 
particularly simple since 


(2), = (tt) mie 


where M is the multiplier defined in Theorem 95 as the integer T’/p. 

In their final paper on this theme [50], they essentially generalized the 
results they had obtained in [48] in the Fibonacci context to general Lucas- 
nomials in the form of the next theorem. 
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Theorem 99 Suppose U = U(P,Q) is a nondegenerate fundamental Lu- 
cas sequence and p > 5, pt Q, is a prime number of rank p. Define S as 


Vn 
Do<n<p Ut: Then 


1. S=0 (mod p), 
2. D=0 = S=0 (mod 5’), 
3. D#40andp=p+1 — S=0 (mod p’), 


where as usual D denotes the discriminant P? — 4Q. 


For some reason, they did not fully complete the investigation. For one 
would naturally want to know whether extensions of the (mod p*)- congru- 
ences (5.79) and (5.80) which hold for the Fibonacci sequence exist in a more 
general form for Lucasnomials. 

This was in large part done by Shi [73] in 2007. She (Shi) obtained the fol- 
lowing theorem as a generalization of (5.80) and of the Brun et al. congruence 
(5.75). 


Theorem 100 Let U(P,Q) be a regular nondegenerate fundamental Lucas 
sequence. Let p > 5, p{ Q, be a prime number. Suppose the rank of appearance 
p ofp inU isp—1 orp+1. Then, for all nonnegative integers m and n, 


M), 


where U' = U(V,, 0°). 


m 


(-1yemningir-nnG)( ). (mod p*), 


n 


Shi also has a theorem, [73, Thm. 2], that expresses in an elaborate way 


Lucasnomials CG modulo p”, with no restrictive hypotheses on p, the 


rank of p. 

In 2015, Ballot, unaware at the time of Shi’s contribution (of this he is 
aware!), obtained a theorem [10, Thm. 13] identical to Theorem 100 but 
with broader hypotheses: The theorem holds modulo p? (p > 5) without the 
restriction that U be regular and for all primes of mazimal rank, i.e., of rank 
Pp — €p, where €, = (D |p) is allowed to be zero. Moreover, the theorem holds 
modulo p? (p > 3), instead of p°, if no restrictive hypothesis is placed on the 
rank p. Thus, with p > 5, p = pu(p) maximal, and U’ = U(V,, Q°), we find 


that 
("* = ( _ oe ee) (”) (mod p*). (5.83) 
np) y iJ ap 


Incidentally, the generalized Wolstenholme-Glaisher theorem [10, Thms. 3 
and 7] for general Lucasnomials uc P.Q) has the shape 


(‘" a ‘) = He tore (mod p"), (5.84) 
U 
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where p > 3, p{ Q, is a prime number, & > 0 is an integer, and 
a=2+[p>5]-[p is maximal ]. 


We observe that (5.83) is a generalization of Brun’s congruence (5.75), but 
Theorem 100 is not. Indeed, if U = U(2,1), then, for all n > 0, U, =n and 
V, = 2. In particular, U’ = U. Since pyu(p) = p is maximal with e, = 0, 
(5.83) applies. Because Q = 1 and p—1 is even for all primes p > 5, the RHS 
of (5.83) is 1, and we recover Brun’s congruence (5.75). 

More refined congruences than Wolstenholme’s for binomial coefficients 
(2) (or for the Wolstenholme-Glaisher congruence) have been the object 
of many papers. MeStrovié [62, pp. 4-7] has compiled a number of congru- 
ences for Ce) modulo p”, where x varies between 4 and 9. These formulas 
involve powers of p with coefficients which typically are expressed in terms of 
generalized harmonic sums and/or Bernoulli numbers. The interested reader 
may wish to consult two more recent papers of interest in this regard [55, 70], 


which gave methods for obtaining higher power congruences for eer: 


Ballot [10, p. 12] chose three congruences that expressed Cea) modulo 
p? as well as the (mod p®)-congruence of Tauraso [77] 
2p—1\ _ 2p 6 
(Pot) 14a DS p+ = (mod p’), 


all valid for p > 7, to examine their potential Lucasnomial generalizations. 
For instance, Tauraso’s congruence generalizes [10, Thm. 28] as 


2p — Q 4 nites) ap YoU; es Z 
=(-[)“"0-= (142 > he ara e (mod p”), 
€ “lip Vegan 6 oP ween 


where p > 7 is a prime of maximal rank p with respect to a general Lucas 
sequence U(P, Q). 

In 2013, Ohtsuka [64] proposed in the “Advanced Problems and Solutions” 
of the Fibonacci Quarterly (openly available online) to prove that the con- 
gruence 

(‘" 4p ‘) = (-1)F®-D/2 (mod #1, 
p-1 F 

holds for all primes p > 3 and all integers k > 0, where F, and L, are the pth 
Fibonacci and Lucas numbers. Since the congruence is obviously an analogue 
of the Wolstenholme-Glaisher for Fibonomials, this spurred Ballot to find a 
general Lucasnomial analogue in this direction. Thus, if (U,V) is a pair of 
Lucas sequences with parameters P and Q and p > 3 is a prime, then, for all 
k > 0, we have [14, Thm. 3] 


e bua ‘). = QhP-0/2 (mod U2V;/P). (5.85) 
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There is a corresponding Lucasnomial generalization [14, Thm. 17] of congru- 
ence (5.75) of Brun et al., namely, for p > 3, m and n nonnegative integers, 


("”) = Qn(m=n)p(p-1)/2 (") (mod U“V,/P), (5.86) 
np U n U' 


where U’ = U(V,, Q?) anda =2+[p>5]-[U, =+p]. 

The factor U? of the above modulus UV, /P in (5.86) suggests looking for 
a possible upgrading of the modulus of the g-congruence of Andrews (5.76). 
Indeed, U, plays the role of the polynomial &,(q). 


Point 3 of Theorem 99 is not quite a generalization of the Wolstenholme 
congruence (5.72) for H,-1, because the authors did not address the case 
€» = 0 (only the case D = 0). This is at least an analogue of (5.72). A 
true generalization came years later. In 2008, in a too-rare link established 
between the g-theory and the Lucas theory, Pan [66] used a q-analogue of 
(5.72), expressing 777" a modulo @,,,(q)?, to go further and establish that 

TI Vi  (m?-1)D Um 


. d P? : 
o> F V,. (mod P*), (5.87) 


Il 
un 


where m > 5 is an integer, (U,V) is a pair of Lucas sequences with parameters 
P and Q, D = P?—4Q, and, as in Hu and Sun’s Theorem 88, Py, is the largest 
factor of U,, relatively prime to U2U3---Um_1. Note that if p =m > 5 and 
Pp = Pp — & is the rank of a prime p, then p > p—1>4so p> 5, and, in all 
a cases of €, = (D|p) € {0,+1}, the factor (m? — 1)DU,, is zero modulo 
p* so that for all primes p > 5 of maximal rank, we find that 


S- Vn =0 (mod p’). (5.88) 
O<n<p ~” 

Thus, Pan’s result implies (5.72) since (5.88) is a true generalization of (5.72). 
In [8, Chap. 4], it was observed that, for some values of D, there are other pairs 
(X,Y) of second-order recurring sequences with characteristic polynomial 
x? — Pr +Q, for which sums of ratios }>,(X;/Y;), where the sum is over 
intervals of length p on which Y # 0 (mod p), are zero modulo p’, if p = 
pu(p) is maximal. In fact, it was noticed that for all pairs (X,Y) studied 
in [8] for which the Wolstenholme phenomenon occurred, X and Y differed 
by V with respect to the Laxton group law. Ward [81] defined and Laxton 
later used the following terminology: Two second-order linear recurrences 
with the same characteristic polynomial are said to be polar if they differ 
by V in that group. (The polarity relation is symmetric.) It is not true that 
the Wolstenholme congruence holds for all polar pairs of sequences. Thus, it 
would be of interest to describe exactly when the phenomenon takes place. 
(The Laxton group will be described in Chap. 9, Sect. 9.7.) 


158 5 Some Properties of Lucasnomials 


Leudesdorf [54] proved in 1889, the year of the completion of the Eiffel 
Tower in Paris, that if 2 > 1 is an odd integer and p > @+ 4 is a prime 
number, then 

1 
S> = =0 (mod p’). (5.89) 


ne 
O0<n<p 


This is an extension of Wolstenholme’s congruence (5.72) since for ¢ = 1, 
(5.89) says that for all primes p > 5, Hp; =0 (mod p”). Congruence (5.89) 
also appears as Theorem 131 in the classic book of Hardy and Wright [42]. 
A generalization of (5.89) to Lucasnomials was found in 2014 by Ballot [9, 
Thm. 3]. Suppose (U,V) is a pair of Lucas sequences with parameters P and 
Q. If £> 1 is an odd integer and p > £+4, p{ Q, is a prime of maximal rank 
p=P—p, & = (D |p), in U, then 


L 
oy Vn = (mod p”). (5.90) 


In closing this section, we note that Wolstenholme’s congruence for Hp_1 
(5.72) says that the sum of all Egyptian fractions 1/2, for 1 <i <p (4 prime 
to p), is zero modulo p. Leudesdorf [54] also found an extension of (5.72) to 
all composite integers m > 1, not just for m prime. That is, 


Theorem 101 Let m be an integer > 1. Then 


where €m = gcd(m,6), unless m = 2%, a> 1, when em = 4. 


Some work related to generalizations of Theorem 101 for Lucasnomials 
can be found in the papers [7, 43]; see in particular [7, Thm. 3]. 


5.9 A Word on Lehmernomials 


We define here Lehmernomials, or Lehmernomial coefficients, 


as generalized binomial coefficients with respect to a Lehmer sequence 
U(R,Q); see Sect. 4.4 for the definition of the Lehmer sequences U(R, Q) 
and V(R,Q) and a few of their basic properties. 

Much of the theory developed with respect to Lucasnomials in this chapter 
may have a potential comparable theory for Lehmernomials. Here, we merely 
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define them and observe that they are integers and that Lehmernomial Cata- 
lan numbers and Lehmernomial Fuss-Catalan numbers are also integers. 


Definition 8 (Lehmernomial Fuss-Catalan numbers) Let a > 2 and n 
be integers and U(R,Q) be a U-Lehmer sequence. The numbers 


1 (“") 
U(a—1)n41 nd Go 


are called Lehmernomial Catalan numbers if a = 2 and Lehmernomial Fuss- 
Catalan numbers if a > 3 (with respect to U(R, Q)). 


Theorem 102 Lehmernomials are integers. 
Proof By (2.42), we get 
Um (WR, Q) = Unar(VR, QU m—n(VR, Q) — QUn(VR, Q)Um—n—1(VR, Q). 


Suppose m and n are even integers. Then dividing both sides of the above 
equation by VR, we obtain 


Um(R,Q) = Unsi(VR,Q)- Yn-nle Q) ae) 


= On+1(R, Q) : Ue, Q) _ QU, (R, QU en alh, Qs: 


‘ Tis OR, Q) 


For all parity cases of m and n, we obtain similarly 


Un, = Renna man — 2 ee 5 ee (5.91) 
where 
Rn = 1, . mn a even; sail Oi = RQ, if mis fe and n even; 
, R, if mn is odd; Q, otherwise. 


Using Lemma 31, we find that 


7 = =I 
(") = RmnUn+1 (" ) _ QmnUm-n-1 i ) : (5.92) 


Using (5.92), the integrality of Lehmernomials follows by carrying an induc- 
tion on m. 


Theorem 103 Lehmernomial Catalan numbers satisfy 


1 (*") =("~") tae 
Uns1 n 7 n on n—-2 U 


and, consequently, are integers. 
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Proof Note that Rep» = 1 and Qann = Q. The same technique used to 
obtain (5.47) yields the result. 


More generally, we have the following: 


Theorem 104 Lehmernomial Fuss-Catalan numbers satisfy 


1 an an—1 Cea Ayn an—1 
Wish = _ Qan(a-1)n— 2 
(a—1)nt1 \"/G nm U n er U 


and, consequently, are integers. 


Proof Replace m by an and n by (a—1)n in (5.92), note that Ran (a—1)n = 1, 
divide through by U(a—1)n41, and rearrange the last term on the RHS of the 
identity as done in [13, Lemma 5]. Integrality follows from the integrality of 
Lehmernomials and the fact that U(R,Q) is a divisible sequence. 
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Chapter 6 


Cubic Extensions of the Lucas 
Sequences 


I don't think that everyone should become a mathematician, but I do believe that 
many students don’t give mathematics a real chance. (Maryam Mirzakhami) 


They never tell us how addictive mathematics can be. That’s wrong. They really 
ought. (Overheard from a graduate student at the 2014 International Conference 
on Fibonacci Numbers) 


Abstract This is the first of three chapters devoted to the question of how 
to generalize the Lucas sequences. We begin with a detailed discussion of 
Lucas’s thoughts concerning this problem, one which he regarded as being of 
very great importance. This is followed by some historical commentary. As 
Lucas regarded linear recurrences as being key to providing an answer to this 
question, we first discuss the possible application of cubic linear recurrences 
to it. We then move on to several proposals for solutions, including Bell’s 
ideas and Williams’s suggestions and concluding with Roettger’s remarkable 
sequences. 


6.1 Lucas’ Attempts to Generalize His Sequences 


In several of his works on the (U,) and (V,,) sequences, Lucas mentioned 
the problem of extending or generalizing them. In particular, he provided a 
lengthy discussion of this question in [15]. In what follows, we will present his 
arguments in [15] and some further remarks of Lucas, which are relevant to 
this topic. It should be pointed out that Lucas frequently repeated himself, 
so we will provide only one of any repeated statement. For more information 
concerning this subject, see Chap. 1 of Roettger [23]. 
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In Sect. I of his memoir [15], he states: 


This memoir has as its objective the study of symmetric functions of the 
roots of an equation of the second degree, and its application to the theory 
of prime numbers. We will first show the complete analogy of these sym- 
metric functions with the circular and hyperbolic functions. We then show 
the connection that exists between the symmetric functions and the theory 
of determinants, combinations, continued fractions, divisibility, divisors of 
quadratic forms, continued radicands, division of the circumference of the 
circle, indeterminate analysis of the second degree, quadratic residues, de- 
composition of large numbers into their prime factors, etc. This method is 
the point of departure of a more complete study, of the properties of the 
symmetric functions of the roots of an algebraic equation of any degree with 
rational coefficients, in their relation to the theories of elliptic and Abelian 
functions, of power residues, and indeterminate analysis of higher degrees. 


In commenting later concerning [15], he added (see Lucas [16]): 


Since the publication of this work, the author has added to it twenty other 
sections, as yet unpublished, which altogether form the arithmetical theory of 
the symmetric functions of the roots of equations of the second degree. The 
author hopes to find the time to write up in a similar manner the theory of 
doubly periodic functions, in their connection to symmetric functions of the 
roots of equations of the third and fourth degree, and with elliptic functions. 


Lucas appeared to abandon this subject until 1891 when he circulated, 
prior to the Marseille meeting (Sept. 17-24) of the French Association for the 
Advancement of Science (AFAS), his most extensive commentary (Lucas [18]) 
concerning this problem. As this communication provides much information 
concerning his thoughts on this matter, we quote the relevant part of it in 
full: 


But we think we should stress in a particular way research concerning 
linear recurring sequences [Lucas often uses the terms ‘recurrence sequences’ 
and ‘linear recurrence sequences’ as synonyms. (See, for example, Chap. XVII 
of [17].)] of various orders and its connection to the theory of elliptic and 
abelian functions. In several memoirs published in the Comptes rendus de 
l’Association, from the meetings in Clermont, Nancy, Paris and le Havre, in 
the Actes de l’Académie royale des Sciences de Turin and of Saint Petersburg, 
in the Nouvelle Correspondance mathématique, in the Journal de Sylvester in 
Baltimore [American Journal of Mathematics], etc., we have demonstrated 
the analogy and, as it were, the similarity of the circular and hyperbolic 
functions with the numerical functions of the second order, whose charac- 
teristic polynomials are of degree two. (See Chaps. XVII and XVIII in our 
book [17]). To every trigonometric formula corresponds a formula for these 
functions, and conversely. 

We had hoped to find in this study, through the prime decomposition of the 
expressions (a” + b"), a demonstration of Fermat’s last theorem concerning 
the impossibility of solving in integers the indeterminate equation 
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P+ yP + 2? =0 


in which it suffices to assume that p denotes a prime. Although Kummer 
treated this equation masterfully a long time ago, it has still not been com- 
pletely solved, as many of the exponents p cannot be dealt with by his ad- 
mirable analysis. 

But if this research plan has not up to now provided the solution of this cel- 
ebrated problem, it does allow us to obtain a number of Wilsonian theorems, 
that is to say the necessary and sufficient conditions that a given p of twenty 
or thirty digits must satisfy to be prime when one knows the decomposition 
in prime factors of one of the numbers p+1. Furthermore, this method guides 
us to the notion of periodicity of the residues for prime or composite mod- 
uli. Therefore, there is every reason to search for formulas analogous to the 
addition and multiplication formulas for the numerical functions which are 
derived from recurrences whose characteristic polynomials are of degree three 
and four. These formulas find their origin in the theory of elliptic functions, 
and we encounter some of them in a beautiful memoir of Moutard (1) 

(1) Poncelet, Applications d’Analyse et de Géométrie, t. ler, addit. III, p. 
542-548. 


[Moutard’s paper appeared as a special section in Poncelet’s book.] 


One rediscovers these recurrence sequences by generalizing the theory of 
linear substitutions, described by Serret in his Cours d’Algébre supérieure 
(4th edition, vol II, pp. 356-412) in a too-particular form. If one considers n 
linear homogeneous forms in n variables provided by the linear substitutions 


Lpp1 =ALp + L1Yp + Vizpt+..., 
Ypt1 = AgLp + loYp + Yazpt..., 
Zp41 = AsLp + UaYp + Vaz%pt..- , 


in which the coefficients A, , v, ..., are constants, the forms %p+1, Yp+1, 
Zp4+1, +++, are expressed as functions of z, y, z,..., and the coefficients of the 
variables x, y, z,..., producing linear sequences having for their characteristic 
equation 
AYU Vy 
Azg fg -U V2 


fie | ei 
A3 3 V3—-U... 


in which u denotes the variable. The ratios of consecutive functions, or of 
coefficients corresponding to two consecutive functions, have for their lim- 
its under certain conditions of convergence, the root of largest modulus of 
the equation U = 0. One can therefore generalize in an infinitude of ways 
Bernoulli’s approximation technique for calculating the roots of equations. 
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This method is developed in the first volume of Legendre’s Théorie des nom- 
bres, but only for a very particular case. 


In Addition X Sur l’extraction des racines pour les moyennes (p. 506) of 
our book, we have pointed out a new process for obtaining roots of any index. 


This process also relates to the preceding theories and to linear substitu- 
tions. We think that, by developing these new methods, by searching for the 
addition and multiplication formulas of the numerical functions which origi- 
nate from recurrence sequences of the third and of the fourth degree, and by 
studying in a general way the laws of the residues of these functions for prime 
moduli, according to how they look, their cubic or biquadratic character in 
relation to the discriminant of the equation U = 0, that we would arrive at 
important new properties of prime numbers. And perhaps the complete proof 
of Fermat’s last theorem is just a consequence of the famous theorem of Ja- 
cobi concerning the impossibility of more than two periods for holomorphic 
functions of a single variable. 


From this material, it appears that Lucas’s investigations into his se- 
quences were originally motivated by his earlier interest in Diophantine prob- 
lems, particularly Fermat’s last theorem. (See Lucas [14] and Chap. 1 for more 
on this topic.) In connection with this, we mention that Ballot [2] showed that 
the sequences (Un), (Vn), where 


dU, = (c— bya” + (a—c)b” + (b—a)c” and V,=a"+b"4c" 


with 6 = (a — b)(b— c)(c — a) provide a weak, but natural, generalization of 
the Lucas sequences ((a” — b”)/(a — b)) and (a” + 6”). Here a, b, c denote 
distinct, nonzero integers. He proved weak Lucas-like laws of appearance for 
certain special primes in an infinite set S and showed that if a certain subset 
T of S is infinite, then Fermat’s last theorem would follow. 


With regard to linear recurring sequences, Lucas wrote the following in 
the Introduction to his book [17]: 

The theory of recurrent sequences is an inexhaustible mine which contains 
all the properties of numbers; by calculating the successive terms of such 
sequences, decomposing them into their prime factors and seeking out by 
experimentation the laws of appearance and reproduction of the prime num- 
bers, one can advance in a systematic manner the study of the properties of 
numbers and their application to all branches of mathematics. 


After Lucas’ untimely death, there seemed to be little notice of the prob- 
lem of generalizing his functions. His old friend, C.-A. Laisant tried to kin- 
dle some interest through a question in the journal, L’Intermédiaire des 
mathématiciens [12]: 

We know how much the famous theorem of Fermat concerning the im- 
possibility of the identity «” + y” = z”, in integers, has so preoccupied 
mathematicians. We can no longer ignore that the greater part of the work 
of Ed. Lucas on the theory of numbers had for its object, direct or indirect, 
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the quest for a demonstration of this theorem. In particular, he published a 
very interesting memoir Sur la théorie des fonctions numériques simplement 
périodiques. These functions U;, and V,, arise from the equation of the second 
degree and present striking analogies with the sine and cosine functions. 


In seeking to generalize these ideas, Lucas later sent a Communication to 
the Société mathématique de France, which unfortunately was not inserted, 
no Note having been put in by the author, on three numerical functions, aris- 
ing from the equation of the third degree, offering very great analogies with 
the elliptic functions, and exhibiting the property of being doubly periodic. 


Shortly after, during a conversation that I had with him, Lucas said to 
me: ‘if one could establish that my doubly periodic functions only have two 
distinct periods, the theorem of Fermat would be proved.’ And during a meet- 
ing, assuming this hypothesis, he justified his statement by a demonstration 
which was easy for me to follow, but of which I have totally lost any memory, 
being far from suspecting his approaching death. I recall only that the case 
of the exponent 2 was isolated, just as it should be, in a very precise manner. 


It is possible that Lucas had made similar Communications to other col- 
leagues, more favoured from the point of view of memory and more attentive 
than I had been. In this case, I make an appeal to them through the present 
question to consult their memories. It may also be possible that the members 
of the Société mathématique might be able to recover the three numerical 
functions of which I have spoken, and concerning which I have not found any 
indication in the papers left by the author. This would assuredly be a very 
interesting gap to fill. 


Laisant amplified these remarks in response to a query from D. E. Smith 
in 1913 sent at the behest of E. T. Bell (Bell [5}): 


The mathematical papers of Lucas, after his death, were entrusted to a 
commission of three members: M. Delannoy, Lemoine and myself. We found 
in them the necessary elements for the publication of the last two volumes (iii 
and iv) of the Récréations Mathématiques, and of the volume L’Arithmétique 
Amusante. 


The remainder consisted of scattered notes which, in our estimation, were 
not available for publication... I found no trace of the subject about which 
you particularly enquire, and I regret it keenly. I had studied with great 
interest the memoir Sur les fonctions numériques simplement périodiques, 
and I often chatted over it with the author. These functions U, V, derived 
from the equation of the second degree present curious analogies with the 
sine and cosine. Lucas has also considered three functions derived from the 
equation of the third degree, on which he once made a communication to the 
Mathematical Society of France. I can find no trace of this communication, 
and I have lost all memory of it. I recall only the definition of one of these 
functions; it was a” + 6” +c”, a, b, c, being the roots of the equation. From 
the point of view of periodicity these functions exhibited the closest analogies 
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with sn, cn, dn of elliptic functions. They presented certain characters of 
double periodicity. 


On examining the above material, we note several aspects of Lucas’ ex- 
ploration of his sequences and his analysis of those that he considered might 
be suitable generalizations of them. We certainly see that he was interested 
in functions satisfying linear recurrence sequences; these functions should be 
symmetric functions of the roots of an associated polynomial with rational 
(in practice, usually integral) coefficients; further, it seems that he was con- 
sidering more than one function in this study. He also appears to have been 
particularly interested in functions linked to polynomials of degree three or 
four. Admittedly, it is difficult to determine in these cases when Lucas was 
referring to sequences satisfying a third- or fourth-order linear recurrence or 
to sequences whose terms are symmetric functions of the roots of a third- 
or fourth-degree polynomial; he mentions both. He emphasized the need to 
find addition and multiplication formulas involving these functions, as this 
was what he did in order to prove the many properties of his own functions 
of order 2. His suggested method of approach was to use empirical methods 
to attempt to elucidate what the laws of appearance and repetition for these 
functions would be, and from this material he should be able, as he did in the 
case of (U,,) and (V,,), to derive further properties of primes, such as primal- 
ity testing algorithms. However, despite this possible program of research, he 
seems never to have produced any results that were analogous to his work on 
the Lucas functions. 


In his memoir, he mentioned three different possible directions of approach; 
however, it seems that he regarded these as being aspects of a more general 
methodology involving the study of singly and doubly periodic functions. 
The first of these involved symmetric functions of the zeros a, 8, y of a 
cubic polynomial which also satisfy linear recurrences. In particular, Lucas 
considered the power sum S, = a”+6"+7”", for the characteristic polynomial 
x —x—1. This specific sequence (S;,) is now called the Perrin sequence, and 
was the focus of much attention by Adams and Shanks [1]; it seems first 
to have been considered by Catalan [6] in 1861. Catalan, who denoted the 
sequence by (A,,), believed that A,, is or is not divisible by n according to 
whether or not 7 is a prime. This could easily be converted into a primality 
test for N that would execute in polynomial time in log N, but unfortunately, 
Catalan’s assertion is untrue. For example, we see in [1] that 271441(= 5217) 
divides Ao71441; furthermore, several other values of composite n such that 
n | A, are also provided in [1]. We should remark here that some of the work 
in [1] was later extended by Szekeres [28]. Also, Grantham[9] has shown that 
there exist infinitely many Perrin pseudoprimes, as conjectured in [1]. 

In Chap. 8 of her thesis, Décaillot [7] raised the interesting possibility that 
Lucas was aware of Catalan’s work before he (Lucas) embarked on his work 
in primality testing and that it might have inspired him in his investigations. 
However, Lucas, who knew Catalan and who is usually most punctilious in 
assigning priority, nowhere mentions Catalan’s work. Also, Lucas’ result (see 
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Sect. XXI of [15]) concerning this matter (if n | A, and n{ A, for all 7 such 
that 0 < j <n, then n is a prime) is more carefully stated than Catalan’s, 
even though Lucas’ proof of this result is incomplete. Perhaps Lucas did not 
want to embarrass Catalan by mentioning his less circumspect work or, more 
likely, he was simply unaware of [6]. 


Lucas’ second suggestion was a study of the function: 
Uz, = Alo", 2",.9", -++)/A(a, 8,7, m+), (6.1) 


where a, 3,7,... denote the roots of an associated polynomial and A(a, 3,7, 
---), denotes the discriminant of that polynomial. He claimed that such a 
study would result in the generalization of the principal formulas contained 
in the first part of his memoir, but did not follow up on this idea. Lucas’s 
third idea was motivated by the connection of the Lucas sequences to the 
singly periodic circular functions and by their periodic properties modulo m 
(see Sect. 4.1). It seems that from what little he did write on this matter that 
he believed an attempt to generalize his sequences could begin through an 
investigation of the properties of doubly periodic functions, such as the elliptic 
functions, and their connection with recurrent sequences. We will discuss his 
third suggestion in some detail in Sects. 8.1 and 8.2. 


6.2 Cubic Linear Recurrences 


As it appears that Lucas considered linear recurring sequences of order three, 
or cubic recurrences, to be key to the problem of extending of the Lucas 
sequences, we will discuss such sequences in this section. We have already 
noted his interest in the cubic recurrence sequence (A,,). We now examine 
the recurrence sequence (X,,), where P, Q, R are given integers and 


Xnt3 = PXn42-QXni1—-RXn, forn>0. (6.2) 
Let a, 8, y be the three roots of the characteristic polynomial: 
f(x) =2° — Px? +Qa— R. (6.3) 


Since (6.2) is a homogeneous linear difference equation of degree three with 
constant coefficients, we know that if a, 6, y are distinct, then 


Xn = ca” + 2B" + 3", (6.4) 


for certain constants Cc, C2, c3. These constants can be determined from initial 
conditions on (X,,) such as the values of Xg, X,, and X92. In the case where 
only two of the roots of f(x) are distinct, say a = 8 and y # a, then we 
replace (6.4) by 
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Xp = (ant co)a” +03"; 
when a = 6 = 7, we replace (6.4) by 


Xn = (eyn? + con + cg)a”™. 


For an elementary introduction to the theory of difference equations, see, 
for example, the book of Goldberg [8]. All of this material was known to 
Lucas (see Chap. XVII of [17]), but it appears he was unable to find a par- 
ticular set of third-order recurring sequences which was satisfactory. One of 
the main features of the (second-order) Lucas sequences is that U is a divis- 
ibility sequence (see Sect. 2.3), and much of the rich arithmetic theory of his 
sequences is dependent on this; however, it seems that the only third-order 
recurring sequences (X,,) which are also divisibility sequences are given by 
X, = n?a"~1, or X,, = U?, where a is an integer and U,, is either a Lucas 
or Lehmer function. The arithmetic theory of these functions is essentially 
the same as that of the Lucas/Lehmer functions, and therefore they would 
not have served as an interesting or useful generalization of U. In [10] Hall 
showed that if (X,,) is a divisibility sequence and Xo # 0, then the totality 
of the primes that can divide all terms of (X,,) is finite; indeed, these primes 
must divide the gcd(Xo, R). Thus, such a sequence is of limited interest. The 
following result was also proved by Hall in [10]. 


Theorem 105 (Hall [10]) There is no divisibility sequence (X,) with Xo = 
0 whose characteristic polynomial is an irreducible (over the rationals) cubic 
whose last two coefficients are relatively prime. 


Later Ward [30] eliminated the condition that the last two coefficients of 
f(x) be coprime. In a footnote on page 580 of [10], Hall states that he learned 
from Ward that he (Ward) had shown that there could be no (interesting) 
divisibility sequence whose characteristic polynomial has a linear and an irre- 
ducible quadratic factor. However, it appears that either Hall or Ward might 
have been mistaken in this assertion because Ward makes no reference to this 
result in any of his papers, particularly where this result would have been 
very relevant to his theme. See, for example, p. 88 of [30]. 


It seems, then, that it is unlikely that there exist any interesting divisibility 
sequences of order three, but it must be stressed here that so far this has not 
been proved. Certainly no such sequence has been found up to now. Given 
Lucas’ use of empirical methods, he might have discovered that using third- 
order linear sequences would not provide a fruitful avenue of discovery, but 
see the remarks at the end of Sect. 7.2. Nevertheless, Ward [30] was able 
to develop laws of repetition and appearance for primes in a general cubic 
recurrence; however, it is questionable as to whether these were the kind of 
results that Lucas was seeking, as he seems to have been concerned with 
particular sets of sequences satisfying (6.2). 
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6.3 Bell’s Contribution 


On reading Laisant’s letter, E.T. Bell [5] developed an interest in the problem 
of extending the Lucas sequences to third-order recurrences. In the Lucas 
sequence case, we observe that there must exist integer sequences (X,,) and 
(Y,) for n > 0 such that 

a” =Y,at+ Xn, (6.5) 
where Xp = 1, Yo = 0, X, = 0, Y, = 1. Since £ is also a root of x7— Px+Q, we 
see that (6.5) must hold when we substitute 6 for a. From the resulting two 
equations, it is easy to deduce that V,,(P,Q) = 2X, + PY, and U,,(P,Q) = 
Y,. Notice that we have the formulas: 


am = Vina + Xmin = Vna+Xm)(Ynu + Xn) (6.6) 
and 
a” = Vinna@ + Xmn = (Ynat Xn)”, (6.7) 


which can be used to find the addition and multiplication formulas for (X,,) 
and (Y;). 

This appears to provide an avenue of approach to the problem of extending 
the Lucas sequences. We now let a, 3, 7 be the three roots of (6.3) and define, 
as did Bell, the integer sequences (an), (Yn), (Zn) for n > 0 


a” = gn07 + yn + Ep. (6.8) 


with similar expressions for 6 and y. In this case we have xp = 1, yo = 0, 
Zo = 0; 41 = 0, yy = 1, 21 = 0; ro = 0, yo = 0, 22 = 1. Also, each of the three 
sequences (2p), (Yn), (Zn) satisfies the recurrence (6.2), and each of xy, Yn, 
Zn can be written as a symmetric function of a, 8, y. Unfortunately neither is 
Sy, =a"+6h"+7", but we can write S,, as a simple linear combination of xy, 
Yn, and Zn: Sy = (P?-2Q)2n+PYyn+3xn; indeed, we can write any particular 
integral sequence satisfying (6.2) as a linear combination (with fixed integral 
coefficients) of %, Yn, and zp. It is easy to see that the sequences (V,,) and 
(U,,) defined in Ballot [2] are given by V,, = S, and U;, = Zn. 


Further properties of Bell’s sequences have been derived by Ward [29], 
Mendelsohn [19], and Miiller [20]. In the latter paper, it is shown how they 
can be used to produce a fast algorithm for solving the cubic congruence 
x? = m (mod p), where p is a prime such that 9 | p — 1 and the integer m 
is a cubic residue of p. (See Sect.3.4.) We should also mention that Lucas 
himself discusses these sequences in more general context in Sect. 173 of [17]. 


For suppose we put 


f(z) =a" — Pio?) + Por*? —.-. 4+ (-1)*P,, (6.9) 
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where P,, Po,..., Py are fixed given integers. If p; is any of the k roots of 
(6.9), then by complete analogy to (6.8), we can write 


2 


pi = Ae ia + Ce ai ope eae Zon: 


Here we see that Z;,; = 6;,; (0 < i,j < k), where 6; = 0 when i # j and 
6:4 = 1; also, Z;,, = (—1)*~**!Py_;. Furthermore, by analogy to (6.2), we 
get: 

Zantke = PZinsn—1 — PeZimen—ats+> + (-1)" 1 Pe Zin: 


Notice that if f(z) = x? — Px + Q, we get Xn = Zon, Yn = Zin; if f(x) is 
given by (6.3), then tp = Zon; Yn = Zin and Zn = Zon. 

In Sect. 173 of [17], using a different notation, Lucas simply mentions the 
k sequences (Z;,,,) fori = 0,1,2,...,4—1 without much additional comment. 
If these were the sequences that he was thinking about as a possible solution 
to the problem of extending the Lucas sequences, it seems odd that he did 
not say anything further. 


We see that we can use results like (6.6) and (6.7) to find addition and 
multiplication formulas for x,, yy, and z,; indeed, we have: 


Zinn + Yn + San = (no? + nat en)™. (6.10) 


In commenting on a formula which is essentially (6.10), Bell concluded [4] 
with the following comment: 

We reach here an essential difficulty of the whole subject [order three recur- 
rence versions of Lucas’ sequences], that is, the determination in a practicable 
form [Bell’s italics] of zn, Ymn; mn from the identity of definition [(6.10)]. 
The corresponding problem for Lucas’s functions presents no difficulty and 
much of the arithmetic fertility of his theory springs directly from the ease 
with which the complete solution is found in his case and the consequent 
simplicity of the resulting functions. This simplicity vanishes, apparently ir- 
revocably, when we pass beyond second order series. 

In retrospect, this verdict seems unduly pessimistic, but it explains Bell’s 
subsequent abandonment of the recurring sequence method of approach to 
Lucas’ problem of generalization. 


6.4 The Cubic Pell Equation 


Let ¢, denote a primitive nth root of unity (see Sect. 4.3). For example, when 
n = 2, G, = —1 and when n = 3, ¢, and ¢? (¢7') are the two roots of the 
quadratic equation: «? + «+1 =0. We remark that the Pell equation (3.14) 
can be written as 


a? — dy? = (a + Vdy)(a + CoV dy) = 1. 
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Set 6 = Wd. The cubic version of the Pell equation can now be written as 


a? +dy?+d?23—3dayz = (x+dy+6*z)(x+woytwdz)(atwdy+wd?z) = 1, 

(6.11) 
where w = ¢3. For an elementary introduction to this Diophantine equation, 
see Chap. 7 of [4]; for a more advanced treatment and several references, see 
Chap.5 of Hambleton and Williams [11]. The behavior of the solutions of 
(6.11) is very similar to that of the solutions of the Pell equation, in that if 
d is not a perfect integral cube, there always exists a fundamental solution 
(X1,Y1, 21) of (6.11) from which all other solutions can be derived. The 
determination of the fundamental solution is a rather intricate process for 
most values of d, but Rudman [26], completing earlier work of Nagell and 
Stender (see [26] for references), showed that whenever d has the parametric 
form d = M* +k, where k | 3M?, then it is a simple matter to find the 
fundamental unit 7 (< 1) of the pure cubic field Q(6). In almost all cases 
n = (6— M)3/k. If we we apply his results in the particular case of k | 3M, we 
usually (|k| 4 1) have the fundamental unit of the ring Z[6] as 7 = (65-M)?/k 
or 


k?n-' =9M® + 9M?k + k? + (9M° + 6M7k)d + (9M* + 3Mk) 6. 


For example, if d = M? +, where e« = +1, then, Nagell showed that we 
can put X, = M?, Y, = M, Z, = 1, but if d = M? + 3e, then we put 
X, = M6 + 3eM? +1, ¥) = M° + 2eM?, Z) = M*+eM. 

If we put a = X, + 6Y, 4+ 6?Z,, 8B = X, + wd5Y, + wd?Z, and y = 
X, + wdY, + w26?Z,, then any integer solution of (6.11) can be written as 
(Xn, Yn; Zn) for some integer n, where 


Xyt6Y,4+6Z, =a", Xnt+wdY,+w6"Z, = 8”, XntwdY,+w6"Z, =". 
From these equations, we can easily deduce that 
3X =a" + BP +", 
36Y, = a” + wh” + wy”, (6.12) 
35°Z, = a +w7p” + wy”. 
Notice that a, 8, y are the roots of the cubic polynomial 2° —3X 12? +3(X? 


dY,Z,)x—1; hence, the sequences (Xp), (Yn), (Zn) all satisfy the third-order 
linear recurrence (6.2) with P = 3X1, Q = 3(X? —dY,Z,), and R=1. 


6.5 Williams’s Sequences 


On referring back to (2.4), we can produce an alternative definition of the 
Lucas sequences (U,,) and (V,,) by first putting, for given integers P and D, 
a =(P+56)/2, 8 =(P—5)/2, where 6? = D. We then define: 
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Vn =a" + 8”. 


We then have U; = 1, Vi = P. In [31], Williams made use of this definition 
and (6.12) in order to produce a cubic extension of the Lucas functions. For 
integers P, Q, R (4 0), D, let a, 6, 7 be the roots of the polynomial (6.3) 
and 63 = D. We define: 


W, = 0" +B"+7", Vn = (a +wB"+w?y")/6, Un = (a +w?B" tury") /6", 


subject to the constraint that W,, Vn, Un all be integers for all n > 0. Note 
that we now are using the notation (U,,) and (V,,) to refer to these new 
sequences, not the standard Lucas sequences. Naturally, this constraint leads 
to a priori conditions that have to be satisfied by P, Q, R, and D, and these 
are provided in Chap. 4 of [31]. It is important to observe that, unlike the 
case of the Lucas sequences, the sequences (W,,), (Vn), (Un) (n > 0) cannot 
all be integral for any choice of the integers P, Q, R. This is because the 
functions W,, Vn, Un are not symmetric functions of a, 8, y. For the P, Q, 
R selected as above, it is easy to show that W,, V,, Un are related to the 
corresponding Bell sequences by W,, = 27/3, Vn = Yn/3, Un = 2n/3. 


For given complex numbers a, b, put ; = wa — w2b and ©) = w?a — wb. 


We define Ry = (w?O? — w6?)/(w? — w), Sp = (WOT — w?F)/(wW? — w), 
Ty = (@? — &%)/(w? — w). In Chap.3 of [31], it is shown that if we put 
Lm = Wm + WVm + w?Um, a= w?Um — Wm, b= Wm — wVm, then 


Wie a TAR Ss. SOV a eR. Sg Se ST. 


Many identity properties of Ry, S,, Tn are presented in [31], and in [34] 
further arithmetic properties are derived for the case in which a — b and 
a? + ab + b? are both integers. These results were extended and generalized 
by Ballot in [3]. 

Just as in the case of the Lucas sequences, the sequences (W,,), (Vn), 
(U,,) satisfy a number of algebraic identities, and it is possible to derive 
subtraction, addition, and multiplication formulas for each of them. However, 
since each of these sequences satisfies the third-order linear recurrence (6.2), 
we would not expect any of them to be a divisibility sequence. Nevertheless, 
this difficulty can be overcome by simply considering D, = gcd(Vn,Un). 
While the sequence (D,,) is not a linear recurrence sequence, it is a divisibility 
sequence, and many arithmetic properties of these extended Lucas sequences 
can be derived by making use of this observation and subsequent results 
concerning V,, and U,, modulo a prime. In this way, we can derive a law of 
appearance, a law of repetition, and a version of Lucas’ fundamental theorem. 
Also, it is possible to employ these sequences to produce some primality tests. 
Indeed, all of Lucas’ arithmetic results about his sequences have analogues 
in the realm of (W,,), (Vn), (Un). 
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Also, Pocklington [22] made use of a special case (D = m, W, = 3u,, 
VY, = 301, U1 = 3ui1, where R = w} + mv? + m?u3 — 3wiviu1 is not a 
cubic residue of p) of these sequences to solve the cubic congruence x? 
(mod p), where p is a prime congruent to 1 modulo 3 and m is a cubic residue 
of p. The technique can be readily deduced from the triplication formulas for 
W,, Vn, Un and the result that p | D,-; whenever p = 1 (mod 3) and D 
is a cubic residue of p. Pocklington’s idea was extended in Williams [32] to 
the solution of the general binomial congruence x? = m (mod p), where p 
and q are primes, qg is odd and q | p— 1. This required sequences which were 
generalizations of the sequences (W,,), (Vi), (Un), and the method represents 
an extension of the Cipolla technique discussed in Sect. 3.4. It was shown in 
Williams [33] that such sequences also share many of the same identity and 
arithmetic properties as the original Lucas sequences. The method of [32] 
was later improved by Williams and Hardy [35]. We also mention that Said 
and Loxton [27] have employed the Williams sequences to produce a cubic 
analogue, similar in concept to LUC, of the RSA cryptosystem. 


=m 


While it may appear, then, that the sequences described above might have 
been what Lucas was seeking, it is unlikely that this extension of his functions 
would have been acceptable to him. While these sequences do satisfy a linear 
recurrence of order three, they are not symmetric functions of the zeros of 
the characteristic polynomial f(a), and therefore they are not always integers 
unless the coefficients of f(a) possess certain special properties. 


6.6 Roettger’s Sequences 


One of the earliest attempts to extend the Lucas functions was done in 1880 
by de Longchamps [13]. If we put R = a6, where a, 3, 7 are the zeros of a 
cubic polynomial f(x), Longchamps regarded D,,, E,, and S,, where 


RD, = (0 + B")(B" +9")(9" +0"), 
R Ey, = (a” — B")(B" — y")(y" — a) /(a— B)(B- yy — &), 
S,=art+ B+", 


as the degree three functions analogous to the Lucas functions. Are these 
the three functions that Laisant mentioned? They are symmetric functions 
of a, 8, y, and they would have been known to Lucas because he was the 
Session Chair for the talk in which Longchamps presented his results. Also, 
as we mentioned at the end of Sect.6.1, Lucas had already mentioned a 
general version of the function A(a”, 8",y", ---)/A(a, 8,7, +--+) which, in 
the cubic case, Longchamps denoted by R” E,,. In fact Longchamps showed 
how to express D, and E,, in terms of the coefficients of f(a). However, 
Lucas would not likely have been keen on the fact that the first two of these 
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functions would not necessarily be integral, but this is a minor difficulty which 
can be easily overcome. 


In [23] (see, alternatively, Miiller et al. [21]) Roettger began with a cubic 
polynomial f(x) = x? — Px? + Qx — R, where P, Q, R are integers and put 


6=(a—8)(8—y)(y-—a), A= = P°Q? —4Q° —4RP? + 18PQR— 27K’, 


where a, 3, y are the zeros of f(x). Note that here A denotes the discrim- 
inant of f(a), a cubic polynomial. We will assume here that 6 4 0. He 
next defined functions that we will denote here by U, or U,(P,Q,R) and 
W,, or W,,(P, Q, R)—in this section we will confine the notation U,,, W,, to 
Roettger’s functions only—as 


6Un = (a" — B")(B" — 7") (y" — a”) 
= (ae + Bryn +7%07") _ fae" A Bae +77"q"), 
Wr = (a B2" + Bryn +7"0?") 4 (a?" p” Sa +77"q"), 


We have Uo 0, U, 1, U2 PQ-R; Wo = 6, W, = PQ —- 3R, Wz = 
P2Q? — 2Q3 — 2RP? + 4PQR — 3R?. Also, U_, = —R72"U,, and W_y, = 
R-?"W,,, results analogous to (2.7). 

Note that U,, is the same as Lucas’ A(a”, B",y", ---)/Ala, 8,7, °°: (= 
R°E,,) and W,, = Vn — 2R”, where 


Vy = Dy =O + A B® Sa He): 


Both U, and W,, are symmetric functions of a, 8, y and are therefore in- 
tegers for all nonnegative values of n. It is these functions that we will use 
as our extensions of the Lucas functions U,, and V,,. Observe that (U,,) is a 
divisibility sequence. However, both the sequences (U,,) and (W,,) satisfy the 
6th-order recurrence: 


Xn46 = ArXn45— AaXn44 + AgXn43 — AsXnpo2+AsXn41—-AbXn, (6.13) 


where A; = PQ — 3R, Ap = RP? + Q? —5PQR+6R?, A3 = R(P?2Q? -— 
2Q? — 2RP? + 6PQR—7R?), Aq = R? Ao, As = R*A,, Ag = R°. 

It seems odd that Lucas does not seem to have examined these sequences 
except, perhaps, in a very cursory manner. Possibly, considering his apparent 
fixation with third- and fourth-order recurrences, he was put off by (6.13). If 
so, he made a mistake, because the theory of the sequences (U;,) and (W,,) 
exactly parallels that of the Lucas sequences, even down to the Euler criterion 
(Theorem 36). There are addition formulas, multiplication formulas, and a 
number of identity relations. For example, 


Uon = Un(Wn+2R"), 2Wen = AU? + W? —4R"W,, 
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are the analogues of the Lucas duplication formulas (2.23) and (2.24) and 
AU3n = Un(AU2+3W2), 4W3n = W2(W,—6R”)+3AU2(W,,+2R")+24R°" 


are the triplication formulas. 


Furthermore, there is a rich arithmetic theory, including laws of repetition 
and appearance, which leads to new primality testing criteria. For example, 
if L, = 3?"+! — 3"+! +1 is known to be a prime, then a test can be derived 
for the primality of N,, = 3"*!—2. While rare, such prime pairs (Ln, N;,) do 
exist. These sequences can also be used to produce (Roettger and Williams 
[25]) sufficient and necessary tests for the primality of numbers N when 
N? 4+ N +1 is divisible by a large power of a prime. 


Much of this is discussed in considerable detail in [23], to which the in- 
terested reader is directed. What is particularly remarkable in this system 
is that, just as in the case of the Lucas sequences, only the two sequences 
(U,,) and (W,,) are required for its development. In view of (6.13) and results 
in the next chapter, this seems to be most remarkable, as one would expect 
that more sequences might be needed. (According to Laisant, even Lucas ex- 
pected to need three sequences in the cubic case.) In addition, it is argued in 
[23] that Lucas possessed the mathematical knowledge and ability needed to 
discover these results. 


We conclude this section by pointing out that it is even possible to establish 
a public-key cryptosystem (Roettger and Williams [24]) through the use of 
these sequences. 
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Chapter 7 


Linear Recurrence Sequences and 
Further Generalizations 


Cerveau : Vinstrument, bien str, se détériore, mais on en joue plus habilement. 
(Jean Rostand) 


Le mieux est V’ennemi du bien. (French proverb) 


Abstract In this chapter we examine the problem of generalizing Lucas’ 
sequences from the perspective of linear recurrences. We first derive a number 
of features of such recurrences, culminating in a generalization of Lucas’ law of 
appearance. We follow this with an investigation of the properties of impulse 
sequences, particular cases of the more general linear recurrence. It turns out 
that the impulse sequences have many characteristics in common with the 
Lucas sequences. We next turn to the problem of generalizing the Lehmer 
sequences and how they may be used to produce necessary and sufficient 
tests of primality for numbers N of the form N = Ap” +7, where p is an odd 
prime, 2| A, p{ A, y € {1,-1} and A < p”. 


7.1 Linear Recurrence Sequences 


In this section we will consider some of the properties of linear recurrence 
sequences with constant integer coefficients. As we have seen in Sect. 6.1, these 
seemed to be the sequences that Lucas considered as essential to solving the 
problem of generalizing his sequences. We begin by generalizing some of the 
results in Sect. 4.1. Let k be a fixed positive integer and let the sequence (X,,) 
be determined by an initial vector of k integer values: 


To = (Xo, X1, X2,.--, Xk-1) (7.1) 
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and the kth-order linear recurrence 
k-1 
Xnte = PiXnte—-1— PoXnpr—2+++++(-1)" PeXn, (7.2) 


where P|, P2,...,P, are given fixed integers. Notice that the values for X;, 
for all n > 0 are completely determined by (7.1) and (7.2). We define the 
characteristic polynomial f(a) of the linear recurring sequence (X,,) to be 


f(x) =a" — Pyo*! + Pog*-* —...+(-1)* Py. (7.3) 


We define the order of (X,,) to be the least positive value of k& such that 
(X,) satisfies a relation of the type (7.2). If (X,) is a linear recurring se- 
quence, we say, following Ward [13], that an integer m (> 1) is a null divisor 
of (X,,) if for some minimal h (> 0), we have m | X,, for all n > h. If 
(X;,) has no null divisors, it is said to be a non-null sequence. For example, 
a regular Lucas sequence U is always non-null. Observe that necessary con- 
ditions for (X,,) to be non-null are that gcd(Xo, X1, X2,...,Xz-1) = 1 and 
gcd(P;, P2,..., P.) = 1, but these conditions are not in general sufficient. 

For i > 0, let T; denote the k-entry row vector (X;, Xj41, Xi+2,°°+ , Xi+g—1) 
and observe by (7.2) that T;,1 = T;C, where C is the k x k companion matrix 
of f(a). From this it follows by induction that T,, = Tp>C”. We now assume 
that m is a fixed positive integer. Since there are only a finite number of 
residues of the integers modulo m, there must exist some r (> 0) and s (> 0) 
such that C’T® = C* (mod m); hence, T,,, = T; (mod m). Notice that the 
values of r and s here are independent of the initial conditions To. Since for 
any i > 0, T;41 (mod m) depends only on T; (mod m), we must have 


T; =Tj1r (modm)  foralli>s. 
That is, there must exist some tT (= T(m)) such that 7 > 1 and 
Xnir = Xn (mod m) (7.4) 


for all sufficiently large integers n. Notice that if (7.4) holds for all sufficiently 
large values of n, then X,_, = X, (mod m) must also hold for all sufficiently 
large values of n. As the determinant of Cis P,, we see that if gced(m, P;,) = 1, 
we have C” =I (mod m), where I is the k x k identity matrix. Thus, in the 
particular case when gcd(m, P;,) = 1, we see that (7.4) will hold for all n > 0 
and X,_, = X» (mod m) for all n > s. We will now assume that 7 is the 
least positive integer satisfying (7.4); we call r the minimal period length of 
(X,,) modulo m. Observe that if ¢ is any positive integer, then X,-17 = Xn 
(mod m) for all n > s. 
Suppose that for some integer w > 1, we have: 


Xniw =Xn (mod m), 
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for all sufficiently large n > 0. By the same reasoning as that employed in 
Sect. 4.1, we see that we must have 7 | w. 


Theorem 106 Jf im, and mz are coprime positive integers, then T(m m2) = 
lem(t(m1), T(m2)). 


Proof Set « = |lem(r(m1),7(m2)). By the previous observation, we must 
have t(m1) | T(myme2) and T(mz2) | T(m me); hence, w | T(m m2). Also, 


since T(m1) | w and t(m2) | pw, we must have X; = Xi+, (mod mj) 
and X; = Xi+, (mod mg); hence, since m; and me are coprime, we get 
X; = Xitp (mod mmz). It follows that t(mime2) | pu, and therefore 
pe = lem(r(m1), T(me2)). 


We next examine the particular sequence (Z,,) which satisfies (7.2) with initial 
condition vector (0,0,0,...,0,1). This sequence is the same as (Z,—1,n,) in 
Sect. 6.3. For this specific sequence, we will denote the minimal period length 
of (Z,) modulo m by z(m). That is, for all sufficiently large n, we have 


Zntia = Zn (mod m). (7.5) 


Put S; = (Zi, Zit, Zi42, sang Zitk-1) and consider the k x k matrix M whose 
jth row is S;_; for 7 = 1,2,...,k. We see that the rows of M can be per- 
muted to form an upper triangular matrix with diagonal entries all 1. Thus, 
det(M) = +1 and the entries of M~! are all integers. Put B = Ty)M~' and 
let B = (bo, bi, b2,...,b¢-1), where the 6; values are all integers. By defini- 
tion of B, we have T,, = ToC” = BMC”, and since the jth row of MC” is 
Sn+j—-1, we see that MC” is symmetric and 


Xn = boZn + b1Zn41 + b2Zn42 +... + be-1Zn+k-1; (7.6) 


for all integral values of n > 0. We are now able to show that whatever the 
initial conditions on (X;), we know that v(m) must be a divisor of 1(m). 


Theorem 107 For t(m) and 7(m) defined as above, T(m) divides x(m). 
Proof Put 7 = (m). By (7.6) we have: 

Xitn = boZign + bi Zignsi + beZignga +... + bg-1 Zit th-13 
thus, for all sufficiently large 7, we get from (7.4) that 


Xitn — bo Zi + by Zig + be Zi42 ee tan bp-1Zitk-1 = X; (mod m). 


The theorem now follows easily by our previous remarks. 


We next provide a sequence of theorems concerning m(p*), where p is 
any prime. In the case where k = 2, we note that Z, = U,(P1, P2); thus, 
these results can be considered as generalizations of some of the arithmetic 
properties of U. 
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Theorem 108 For any prime p and a> 1, we have r(p**") | pr(p*). 


Proof Put 7 = 1(p*). For all sufficiently large i, we have p* | Zj4,—Z;. Put 
Y; = (Zit, — Z)/p% and observe that Yj; satisfies (7.2); thus, there is some 
T = 7(p*) such that Y;,, = Y; (mod p*) and 7 | 7 by Theorem 107. Hence, 
p°Y itn, = p*Y; (mod p”*) for sufficiently large i. Since Zj4. = Z; + p*Yi, we 
can show by induction on 7 that for 4 large enough, we have: 


Litjn = Zi + jp°y; (mod p?*). (7.7) 


Since 2a > a+ 1, we see that for j = p, we get Zi+px = Z; (mod p**') and 
the result follows. 


From Theorem 108, we also see that since 7(p*) | 7(p**+), we must have 
m(p*t*) = (p") or m(p***) = pr(p*). 


Theorem 109 For any prime p and integer a > 2, we have r(p*t?) # 
m(p°**) when m(p***) A m(p*). 


Proof If we define (Y;) as in the proof of Theorem 108, we see by (7.7) that 
we cannot have p | Y; for all arbitrarily large i, for otherwise we would have 
m(p*t!) # m(p*). Since 2a > a + 2, on putting j = p in (7.7), we get: 


Zitjn = Zi tp ty; (mod p**?). 


Since p cannot divide Y; for some sufficiently large 7, we cannot have 
m(p*t?) A n(pe*"). 


By repeatedly applying Theorem 109, we see that if a > 2 and (p**!) # 
m(p*), then m(p2+) = p’x(p%). Thus, we have produced a result very similar 
to Lucas’ law of repetition. 

By using the theory of finite fields, we know that if p is a prime, we can 
decompose f(x) over F, as 


f(x) = filw)™ fo(@)? ++ f(a)" (mod p), 


where f(x) is of degree k; and irreducible over F,; furthermore, we may 
assume that the polynomials f;(a) (1 <7 < m) are all distinct. Notice that 


ek, + egke +++» +emkm =k: 


Let e = maxi<i<m{ei}. We have e = 1 when p does not divide the dis- 
criminant of f(z). We also let p; denote the k;th power of p and put 
l= lemi<i<m (pi _ 1). 

The following result corresponds to Lucas’ law of appearance. 


Theorem 110 Let p be prime such that p{ Py(= f(0)), and define the non- 
negative integer k by p’ <e < p**!. Ife =1, we have m(p) | 1 and ife > 1, 
then m(p) | p® tt. 


7.2 Impulse Sequences 187 


Most of the above results were known to Carmichael [4] and others were 
added by Engstrom [6]. Further results on this topic were obtained by Ward 
[12]. We have followed here the presentation in Robinson [11] and an un- 
published note of Curtis Bright [3]. Because of their application to electrical 
engineering, in particular to the construction of linear feedback shift registers, 
the literature concerning linear recurrences is enormous. See, in particular, 
Chap. 8 of [9] for a modern treatment of this material and many references. 


7.2 Impulse Sequences 


On page 2 of Everest et al. [7], the sequences which we have denoted by (Z;,n) 
fori =0,1,...,k—1 are called impulse sequences. In [9] the specific sequence 
(Zp) or (Ze—1,n) is called an impulse response sequence. It is easy to see that 
if (X,,) is any sequence which satisfies (7.2), then 


Xn = XoZon + X1Ziyn +++ + Xp-1Zk-1,n- 


Thus, the impulse sequences form a basis for all linear recurrence sequences 
with characteristic polynomial f(x) given by (7.3). We have already men- 
tioned that the Lucas sequence (U,) is the impulse sequence (Z,) when 
k = 2. Also, using the notation of Bell [2] (see Sect. 6.3), we have tp, = Zon, 
Yn = Zin; 2n = Zon = Zn when k = 3, Py = P, Pp = Q and P; = R in (6.3). 
Indeed, in [5] Carmichael, who denoted Z,, (or Zp-1,n) by Gy, mentions it 
as one of the two possible generalizations of the Lucas sequence (U;,). 


In this section we will briefly discuss some properties of the impulse se- 
quences that correspond to those of the Lucas sequences when k > 2. We will 
assume here that A, the discriminant of f(a), is nonzero. From simple results 
in Sect. 10.1 of [15], we see that, like the Lucas functions, the functions Z;,, 
for i = 0,1,2,...,k—1 are symmetric functions of the roots of (7.3). Also, as 
noted in Sect. 6.3, if p; is any one of the k roots of (7.3), then we can write: 


—2 


Or = Ze-1npy* + Ze_anpe ? +--- + Zon. (7.8) 


We can use (7.8) to develop addition and multiplication formulas for the 
impulse sequences. For example, we have: 


[ial = Agee + Ti Gos a ee Zontm 
= Gein “F VT a ala Zo,n) x 


Zetg ble jap Be ee), 


for each of the distinct & roots of (7.3). From this it is easy to see that the 
addition formulas are 
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k-1k-1 


Zin= > > Bitintnae (ea UMecigkh= 1), (7.9) 
i=0 j=0 


If we put m = 1 and use the values of Z;, (= 6;,1) for 7 =0,1,.. 
get: 


.,k—1, we 


k-1 
Lin = SS ZinZh itl: 
i=0 
Hence, forh = 0 and1<h<k, we have: 


Zon-+1 = (1)? PF Zhint = Zn—1n a ol aay 2; 


(7.10) 
If we put m = n in (7.9), we get the doubling formulas: 
k-1k-1 
Fig =>" Fete tng (he SO Axi h=T), (7.11) 
i=0 j=0 
and if we put m = n+ 1 in (7.9), we get the formulas: 
k-1k-1 
Zronsi=S_ >) ZimZjnsiZnitg (h=0,1,...,8-1). (7.12) 
i=0 j=0 


If we make use of (7.10) in (7.12), we can get formulas for Zp,.2n41 for 
h=0,1,2,...,k—1 in terms of Z;,, (= 0,1,2,...,4—1). This means that 
if we define the k-tuple 7, as {Zo,n,Z1,n,Z2,n,---,Zk—1,n}, then given the 
k-tuple J,, we can compute the two k-tuples 72, and 72n+41. Furthermore, 
by appealing to (7.10) and then (7.11) with n replaced by n + 1, we can 
also compute 72n+42. Each of these three k-tuples can be calculated in O(k?) 
multiplication and addition operations. As a consequence of this, we can 
then use the ideas of Sect. 3.3 to compute, given any positive integer m, the 
k-tuple Tm, in O(k? log m) multiplication and addition operations, given only 
the coefficients of f(a). For other approaches to the problem of computing 
remote terms in a linear recurrence, see the work of Miller and Spencer-Brown 
[10] and of Fiduccia [8]. 


The multiplication formulas can be derived from 


p. = VST le + en ae ee Zonm 
= (Zp—-1ynP; | + Ze-2,nPh 7 +++ + Zon)” 


and the multinomial theorem. If we let C(to,i1,...,%,-1) denote the multi- 
nomial coefficient m!/(to!é1!--+¢~~1!), then 
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k-1 
Dw S. CGe tty estiai ne | [nls (7.13) 
{io,i1,-.-,in-1} j=0 


where the sum is taken over all the sets of nonnegative integers {ig,71,..., 
: k-1. So, ae : k-1. . 
ip—1} such that = i; = mand s = s(ig, t1,..-,%%-1) = j=0 jo tj. 


Furthermore, by observing that p01 --+ px—1 = Pe—1, we have: 


k-1 
PR» = |] (Ze-1nph! + Ze-2nph? +--+ + Zon) (7.14) 
i=0 


We next derive some arithmetic properties of the (Z,,,,) sequences. We 
first observe by the results in the previous section that if we put Z, = 
(Zon; Zin; 22,n)-++Zk—1,n), then for all n > 0, we have: 


Zn+n = Zn (mod m), (7.15) 


where m is any positive integer such that gcd(m,P,) = 1. Put W, = 
gcd(Z1n, Z2n,-+-;Zk—1,n); from (7.13) it is easy to see that (W,,) is a divis- 
ibility sequence. When k = 3, we get from (7.10) that 


Zant = Zin + Pi Zon; 


hence, in this case W, = gcd(Z1n, Zan) = gcd(Zon41, Zan) = ged(Un41, 
U,,), the sequence discussed in Ballot [1]. 

Since Zp = (1,0,0,...,0), we see by (7.15) that m is a divisor of W,. Call 
the least positive value of n such that m divides W,, the rank of appearance 


of m in (W,,) and denote it by w = w(m). We now have a generalization of 
Theorem 8. 


Theorem 111 Suppose gcd(m, P,) = 1. If w is the rank of appearance of m 
in (W;) andm|W,, then w | n. 


Proof Let n = qw +r, where 0 <r <w. By (7.9), we get 


k-1k-1 


0S Fics YY Ze2i2 nig (mod m) (WS O,1y..,6 1), 
i=0 j=0 


Since Ziq. = 0 (mod m) for 7 > 0, we have 0 = Zp, qutr = Se Z0 qu Zj,r 
Zn,j = Zo,qwZn,r (mod m) for h =1,...,k—1. By (7.14) we have (Zo,qu)* = 
PZ”, (mod m); hence, gcd(Zo,qu,m) = 1, and therefore Zp, = 0 (mod m) 
forh = 1,...,k—1. Thus, m | W,.. It follows that by definition of w, we must 
have r =O andw|n. 


Corollary 112 If gcd(m, P,) = 1, then w(m) | m(m). 
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As must now be evident, many other results analogous to those involving 
the Lucas sequences can be derived for the impulse sequences. It is remark- 
able, given Lucas’ interest in linear recurrence sequences, that he did not in- 
vestigate these impulse sequences further. Perhaps he intended to do so, but 
no evidence of this seems to have survived. We will discuss this at greater 
length later in the next chapter, but for now we will turn our attention to 
generalizing the Lehmer sequences. 


7.3 A Return to the Lehmer Sequences 


In order to generalize the Lehmer sequences in Sect. 4.4, it is helpful to define 
the four sequences (Vj,n) and (U;,,,) for i = 0,1. Here we put: 


Von =Vn(R,Q) and Vi, =0; Up, =0 and Yi, =U0,(R,Q), 
if n is even; 
Vix =O endVin=VA(ROy fog =0a( 80) and ig = 0, 
if n is odd. 


Let p1, p2 be the roots of x? — R and let a; + 8; = pj and a; 8; = Q (i = 1,2); 
notice that since p; = —p2, we get a, = —az or 2, = —/3. It is not difficult 
to see, then, that 


a — OF 
ai — Bi 


We can use (7.16) as an indication of how to extend the Lehmer sequences. 
As in Sect. 7.2, we let pi, p2,..., px be the roots of f(x), where f(x) is given 
by (7.3), and we assume that A, the discriminant of f(x), is not zero. For 
a given nonzero integer Q, we define a;, 6; by a; + 8; = p; and a,b; = Q 
(i = 1,2,...,k). By analogy to (7.16), we now define the 2k functions Uj,» 
and Vin (i =0,1,2,...,k—1) by 


_ Von + PU ns a; + Br = Vown + PiVin (i = 1, 2). (7.16) 


a. _— 4 ms 
a Von + piUin + p;U2n Saal ps aes (7.17) 
ai — Bi 
and 
a? + BP = Von + piVin + pe Van +++ + pf Vian: (7.18) 


Since the roots of f(a) are all distinct (A 4 0), we see immediately that 
Uio =9, Vio = 26:0, Via =Oi411, Vir =O. (@=0,1,...,4 — 1); 


furthermore, since the identities (2.35) and (2.36) hold for any pairs (P,Q), 
even when P is not an integer, and 
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af ~ 6 : 
3, = Unlein@), af + BP = VnloirQ), (7.19) 
we see that Uj, and V;,, are all integers for n > 0 andi =0,1,...,k—1. 


If we set h;(x) = x? — pix + Q, then 


F(x) = hi(2)ho(x)...hy(x) = &* f(x + Q/z) 


is a polynomial of degree 2k with integer coefficients whose roots are a;, {; 
(¢=0,1,...,4—1). Indeed, if we put 


F(a) = 2k _ Ry Roe oo deo Rigi 
then (U;,,) and (Vj,,) both satisfy the linear recurrence: 
Anson = Ri Xn+2n-1 — RoXn+2k-2 +... — RorXn. (7.20) 


For example, when k = 2, we have R; = P,, Ro = Po + 2Q, R3 = QP, 
R4 = Qo and when k = 3 we have Ry = Py, Ro = P» + 3Q, Rz = P3 + 2P,Q, 
Ra = QRo, Rs = Q? Ri, Re = Q?. 

Also, as shown in Sect. 10.1 of [15], the discriminant D of F(a) can be 
evaluated as 

D = EA?QRERD, (7.21) 


where F is the integer f(2VQ) f(—2VQ) = (p] —4Q) (p3—4Q) ... (p§ —4Q) = 
TL, (a: — 6;)?. In the case of k = 2, we get E = (P2 + 4Q)? — 4QP?. 

If k = 1, we have Up» = U,(Pi,Q) and Von = Vn(Pi,Q), and we have 
already seen that when k = 2, P, = 0, P2 = R, we get Lehmer’s sequences 
from (Uj) and (Vj,,), as described above. In [14] (see also Chap. 10 of [11]), 
Williams showed that these generalized Lehmer sequences have many of the 
features possessed by the Lucas sequences. Given the definition of (Uj,,,) and 
(Vin) and (7.19), it is not surprising that we can derive several identities 
involving these sequences; indeed, there are addition formulas, multiplication 
formulas, and techniques for computing remote values of terms of these se- 
quences. In particular, we mention that when computing faraway terms of 
these sequences modulo some N such that gcd(N,Q) = 1, it is somewhat 
more convenient to work with the (Vj) sequences rather than the (Uj) 
sequences. We do this by analogy to the Lucas sequence case by defining 
Win = Q-"Vi2n (mod N) (i = 0,1,2,...,4 — 1) and working with the 
(Win) sequences modulo N. From (10.2.5) of [15], we can produce the addi- 
tion formula: 


k-1 k-1 


Wm+tn = = =S°( WinWinZni+i) —_ Wrn—m (mod N), 
i=0 j=0 


192 7 Linear Recurrence and Generalizations 


where the sequences (Z;,,,) for i = 0,1,...,4 — 1 are the impulse sequences 
with characteristic polynomial f(a). We also have the initial values: 


Wi.o — 26i,0; QWi1 = Zi2 = 2Q6i,0 (mod N) (a = 0, 1, seey k- Ly. 
If for any positive integer 7, we define the k-tuple W; to be 
{Wo,5,W1j,---; We—-1,5}, 


we see that, given any pair of these k-tuples W,, and W,,4; modulo N, the 
formulas (10.2.15) and (10.2.16) of [15] can be used to compute Wa, Wan+1, 
and Wn+42 (mod N) in O(k?) addition and modular multiplication oper- 
ations. It follows that the ideas of Sect.3.3 can then be used to compute 
the remote k-tuple W; (mod N) in O(k? logt) modular multiplication and 
addition operations modulo N. 


It needn’t be the case that any of (Uin) or (Vin), 1 = 0,1,...,k —1, 
is a divisibility sequence, but if we put A, = gcd(Uon,Uin,...,Uk-1n) 
and, when k > 1, C, = gcd(Vin,Van,---;Ve—-1n), then both A, and C,, 
are divisibility sequences. Furthermore, if m is a positive integer such that 
gcd(m,Q) = 1, the sequence (A,,) has a rank of appearance modulo m, 
which we will denote here by w = w(m). In [15] it is shown that if m | An 
for some n > 0, then w | n. It is possible to develop laws of repetition 
and appearance for the generalized Lehmer sequences; it is even possible 
to extend Lucas’ Fundamental theorem to these sequences and to develop 
primality tests analogous to those of Lucas. All of this is described in some 
detail in Chaps. 10-12 of [15]. 


7.4 Primality 


As we have seen, Lucas was motivated in his search for generalizations of his 
sequences because of his belief that they might provide a means of discovering 
new properties of prime numbers. In particular, he was interested in primality 
testing, and we know he was attracted to the problem of proving the primality 
of numbers of the form 

N = Ap" +4, (7.22) 


where p is an odd prime, 2 | A, p{ A, y € {1,—1}, and A < p”. Thus, it seems 
appropriate here to examine the problem of proving N a prime by using the 
generalized Lehmer sequences. 


Let s = (p—1)/2 and q be any prime such that g = 1 (mod p), g{ N and 
N(@-\)/P 4 1 (mod q). By Theorem 11.3.6 of [15], we see that a necessary 
and sufficient test for the primality of N, involving the Lucas sequences and 
similar to the p = 3 case of Theorem 56, can be derived when we have a priori 
some integer R such that G(R) = 0 (mod N). Indeed, if we can find any 
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Lucas sequence (U,,) such that (D|N) = y and N { Uy, then it is an easy 
matter to find some R such that either N | G,(R) or N is not a prime. For 
most values of NV, such a Lucas sequence is easy to find, but in some cases this 
problem can be difficult. In what follows we will show how the generalized 
Lehmer sequences can be used in the development of necessary and sufficient 
tests for the primality of numbers N, given by (7.22), which dispense entirely 
with the need to find R. We stress here that these tests are only of theoretic 
interest, as the use of a result like Theorem 54 or Theorem 57 is almost always 
adequate to establish the primality of NV. 

In Sect.11.1 of [15], it is shown how to compute s integers P(i, p,q) 
(@ = 1,2,...,s) such that if we put k = s and P,; = P(i,p,q) in (7.3), the 
corresponding sequences (A,,), (C,) associated with the generalized Lehmer 
sequences for this particular f(a) with Q = q?~? have some special proper- 
ties. In order to discuss these, we first need to define the function @(m). 

Let m be any odd positive integer such that p{m. If we let v (= v(m)) be 
the index to which m belongs modulo p, we define 0(m) = (m“ + (—1)”)/p, 
where yp. = v when v is odd and yp = v/2, otherwise. Notice that since v | p—1, 
we have 2p0(m) | m?~! — 1. The following is a very slight modification of 
Theorem 11.2.3 in [15]. 


Theorem 113 Let the sequences (An) and (Cy) be those obtained from the 
generalized Lehmer sequences with k = s, P; = P(i,p,q) ( =1,2,...,8) and 
Q =q?~?. Suppose r (4 p) is any prime such that r { qA and put 0 = O(r). If 
r(q-))/P = 1 (mod q), then r | Ag and r | Co; if r4-Y/P #1 (mod q), then 
r|Apo andr |Cyo, but rf AneCno when pt n. 


(We have replaced D(F’) or D by A in this version, but if we refer to the 
proof of Theorem 10.4.4 in [15], we see that the result is certainly true when 
\e;| = 1 for all i < A. However, it is shown in the proof of case 1 of Theorem 
10.3.10 that ¢; = —1, 0 or 1; if e; = 0, we have p | r, which is impossible. 
Thus, by (7.21) we can replace D by gA.) 

If we let N be of the form (7.22), we can obtain a somewhat more general 
version of Theorem 11.2.6 of [15] when (A,,) is defined as in Theorem 113. 


Theorem 114 Let N be of the form (7.22) and let (A,) be defined as in 
Theorem 113. Put 6 =0(N) =N—vy= Ap”. If gcd(N,pqgA) =1, N | Apo, 
and N { Ag, then there must exist a prime divisor r of N such that 


r?-l=1 (mod p”). 


Proof If N | Apo and N { Ag, there must exist a prime divisor r of N such 
that r°||N (b > 1) and r? { Ag. It follows that w(r’) | p@ and w(r®) { 6; 
hence, p” | w(r’). By the law of repetition for (A,) (Theorem 10.3.17 of 
[15]), we know that w(r®) | r™w(r) for some m > 0. Thus, we get p” | w(r) 
and therefore p” | p@(r) by Theorem 113. It follows from the definition of 
O(r) that 


r?-t=1 (mod p”). 
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By appealing to results in Sect. 10.4 of [15], it is also possible to prove the 
following theorem. 


Theorem 115 Let N be of the form (7.22) and let (C,) be defined as in 
Theorem 113. Put 0 = 60(N) = N—vy = Ap”. If gcd(N, pqA) = 1, N | Cpo, 
and N {Cyg, then there must exist a prime divisor r of N such that 


r?-l1=1 (mod p”). 
These results lead us to a study of the congruence 
z?-'=1 (mod p™) (7.23) 


for a given prime p and positive integer m. Let h be any primitive root of p. 
Define gi (p,m) = h?(™ (mod p™), where p(m) = p™! and put gi(p,m) = 
gi(p,m)* (mod p™), for i = 1,2,...,p—1. Since (p—1)p(m) = y(p™), we ob- 
serve that gi(p,m)?~' = 1 (mod p™); also, if gi(p,m) = 9;(p,m) (mod p”™) 
for 1 < i,j <p—1, then hU-YP(™ = 1 (mod p™), which means that p— 1 
must divide i — j; given the range of i and 7, we can only have i = 7. Thus, 
gi(p,m) for i=1,2,...,p—1 are the p— 1 distinct roots of (7.23). Further- 
more, gp—i(p,m) = 1 (mod p™); thus, if N > p” and g;(p,n) | N for some i 
such that 1 <i < p—1, then N must be composite. 

We may assume with no loss of generality that 0 < gj;(p,m) < p™ for 
i = 1,2,...,p — 1. We also notice that both g;(p,m) and p™ — gi(p,m) 
are distinct roots of (7.23); hence, there are exactly s values of the g;(p,m) 
(¢@ = 1,2,...,p — 1) which are odd. Thus, if N satisfies the conditions of 
Theorem 114, there exists a prime divisor r of N such that r = g;(p,n) +tp” 
for some t > 0 and some 7 such that 1 <i < p—1. We next prove a result 
similar to Lemma 20. 


Lemma 33 Suppose that N is given by (7.22), where n > 1 and gi(p,n) +N 
for any i such that gi(p,n) is odd and 1 <i<p-—1. If N has at least one 
prime divisor r of the form r = g;(p,n)+tp” for some t > 0 and some i such 
that 1<i<p-—1, then N is a prime. 


Proof We proceed as in the proof of Lemma 20. Suppose N is composite 
and write N = rT, where T > 1. We get rT = y (mod p”) and therefore 
gi(p,n)T = y7 (mod p”). Since 


1= "1 = (gi(p,n)T)?* =T?* (mod p”), 


we must have T = gj;(p,n) (mod p”) for some j such that 1 < 7 < p—1. 
Hence, T = vp" + g;(p,n), where v > 0. By the conditions of the theorem, 
we must have t,v > 1; however, this means that N > p?”, which contradicts 
A <p”. 


We can now combine the results of Theorem 114 and Lemma 33 to obtain 
a necessary and sufficient test for the primality of N given by (7.22). 
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Theorem 116 Suppose that N is given by (7.22), where n > 1 and gcd(N, 
qA) = 1. Suppose further that N(7-'/P £1 (mod q) and that g;(p,n) + N for 
any i such that g;(p,n) is odd and1 <i <p—1. If (An) is defined as in 
Theorem 118 andm =(N—v7)/p, then N is a prime if and only if N | Apm, 
and N t Am. 


Proof If N is a prime, then N | Apm, and N { Am by Theorem 113. If 
N | Apm, and N { Am, then N must be a prime by Theorem 114 and 
Lemma 33. 


We remark here that we can use Theorem 113, Theorem 115, and Lemma 33 
to prove a result analogous to Theorem 116 for the (C,,) sequence. 


Theorem 117 Suppose that N is given by (7.22), where n > 1 and gcd(N, 
qA) = 1. Suppose further that N(@-))/P # 1 (mod q) and that g;(p,n) { N 
for any % such that g;(p,n) is odd and1<i<p—1. If (Cy) is defined as in 
Theorem 1138 and m= (N—7)/p, then N is a prime if and only if N | Com, 
and N {Cmp. 


Theorem 116 is a necessary and sufficient analogue, using the sequence 
(A,) instead of (U,,), of the test given as Theorem 54. It is even possible to 
produce a test, involving the (W;,,) sequence modulo N, which is a necessary 
and sufficient version of the test given as Theorem 57. 


Theorem 118 Suppose N is given by (7.22), where n > 1 and gcd(N, gA) = 
1. Suppose further that N-))/P 4 1 (mod q) and that gi(p,n) + N for any 
i such that g;(p,n) is odd andl <i<p—1. Ifm=(N—y)/p, then N is 
prime if and only if 


Yis(Wom,: Wijym,-++;Ws—1m) = 9 (mod N) (i =0,1,...,s—1). 


Here the Y;,n(@0,%1,...,-1) ( =0,1,...,k—1) are k polynomials, each 
in k variables and of degree n, defined by 


Gr(aotaip+22p?+...+2~-1p"~') = Yon+YinptYanpe?+.--+Ye-1np™ | 


for all the roots of f(x). For example, Y;,0(o,%1,---,%k-1) = 61,0, Yi,1(“o, 41, 
..+,0p-1) = &; + 6; 9. Also, if k = n = 1, we have the single polynomial 
Yo. = to +1; if k = n = 2, we have the two polynomials Yo2(xo, 21) = 
x2 — Pox? +20 — 1, Yi,2(x0, 21) = Prx? + 2x9x1 + 21. Notice that 


Yi,2(2o, £1) = uy(P, 21 + 2X0 + 1), 
AYo,2(Xo, L1) = (P21 + 2x0 + 1)? _ 2Pix1 (Pia + 2x0 + 1) + Az? —5. 
(7.24) 


In general, for any fixed n > 1, we can compute Yj» = Yin(®o,%1,---,2k—1) 
(¢ =0,1,...,4—1) recursively by making use of the formula: 
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k-1k-1 


Yn = y y LiYitZni+j — Yn-1- 
i=0 j=0 


This formula can be easily derived from the identity Gi4i(%) = xG)(x) — 
Gi_1(«) and (7.8). The (Zp,;) sequences here are, as in the previous section, 
the impulse sequences with characteristic polynomial f(x). When n is large, 
more efficient methods of computing Y;,, (i = 0,1,...,—1) can be deduced 
from (4.18). 

There are many other methods for computing the Y;,5(Wo,m,---; Ws—1,m) 
in Theorem 118; for example, another technique can be derived as follows. 
For any integer n > 0, we simply put Win = Vin/Q” (i =0,1,2,...,4-—1) 
and define 


s—1 
Xin => Winb? (7.25) 
h=0 


by (7.18) and the second formula of (7.19) we see that we can write Xj, = 
Con(pi/ VQ) for the C;,(x) defined by (4.7). By (4.13) we get: 


s-1 
C(Xin) = Cajn(0i/-/Q) = — Wain? (7.26) 
h=0 
From (4.7) and (7.25), we find that 
8 s s—l 
Gs(Xin) = 9 C;(Xin) +1= 55 (>) Waygno?) +1 
j=l j=l h=0 


It follows from the definition of the Y;,, (¢ = 0,1,...,s — 1) that we have 


Yi,s(Wo,m, Wi,m,---,Ws-1,m) = S- Wi,jm + 6i0 (mod N). 


j=1 


We see here that Theorem 118 is a generalization for any odd p of Theorem 56; 
for, in the case of p = 3 we have s = 1,m = (N—7)/3, Yo.1(Wom) = Wom+l, 
Wom = Q-™Vo,2m (mod N), and Voom = Vom(Pi,Q), where Q = q and 
P, = P(1,3,q) =t. For the case of p= 5 (i =0,1,...,k —1), we have s = 2. 
Put P; = P(i,5,q) (i = 1,2) and Q = q?, where gq (= 1 (mod 5)) and r are 
primes such that ged(r,5qA) = 1 and r-)/5 41 (mod q). By Theorem 113 
we cannot have r | Wj, form = (N —+)/5. Thus, in view of (7.24), ap=5 
version of Theorem 56 can be given as follows. 


Theorem 119 Suppose that N is given by (7.22), wheren > 1, p = 5 
and gcd(N,qA) = 1. Suppose further that N“@—-)/> ¥ 1 (mod q) and that 
gi(5,n) { N for anyi such that g;(5,n) is odd and1 <i < 3. Ifm=(N—7)/5, 
then N is a prime if and only if 
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2Wom+PiWim =—1 (mod N) and AW? =5 (mod N). 


We conclude this section by emphasizing that, just as in the case of The- 
orem 53, these results, as tests for primality, are not effective in that they do 
not completely specify for any given N how to find a suitable value of g. This 
obstruction is discussed at some length in Sect. 16.4 of [15], and it seems that 
in most cases it is not a difficult problem, in practice, to find an appropriate q 
for most values of NV. Certainly, we can easily specify an infinitude of possible 
values of N for which it is easy to find a satisfactory value for q. 
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Chapter 8 


Divisibility Sequences and Further 
Generalizations 


I do not know what I may appear to the world, but to myself I seem to have been 
only like a boy playing on the seashore, and diverting myself in now and then finding 
a smoother pebble or a prettier shell than ordinary, whilst the great ocean of truth 
lay all undiscovered before me. (Isaac Newton) 


Abstract In this chapter we consider, starting with some early observations 
of Lucas, the problem of generalizing the Lucas sequences (U,,) and (V,;,), by 
making use of certain divisibility sequences. We provide a brief discussion 
of Ward’s elliptic divisibility sequence and its remarkable attributes. This 
is followed by a discussion of the properties of divisibility sequences which 
satisfy linear recurrences. We use these observations and the basic properties 
of the Lucas sequences to derive some extended Lucas sequences and list their 
properties in certain special cases. These sequences have features very much 
in common with those of (U;,) and (V,,). Finally, we discuss odd and even 
extensions of the Lucas sequences. 


8.1 Another Suggestion of Lucas for Generalizing His 
Sequences 


As stated in Sect.6.1, Lucas mentioned three possible directions that might 
be followed in generalizing his functions. We have already examined one of 
these and an instance of another (Roettger’s sequences). In this section and 
the next, we will discuss his third proposal. Later, in the following sections, 
we will provide a more general discussion of sequences like those given by 
(6.1). 

In Sect. IX of [16], Lucas states: 


We will complete this section with the proof of formulas of extreme impor- 
tance, because these will serve later as a basis for the theory of the numerical 
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functions of double period, derived from the consideration of the symmetric 
functions of the roots of the third and fourth degree with rational coefficients. 
It is not exactly clear what Lucas meant by “numerical functions of double 
period,” because the most obvious meaning would be a function F' such that 
F : Z > Z@ and for any fixed m, we have F(n + pi) = F(n) (mod m) 
and F(n + pz) = F(n) (mod m) for all sufficiently large n and fixed (but 
dependent on m) integral periods p; and p2. However, it is easy to see that 
because there exist integers x and y such that p = xp, + ype, then F is simply 
numerically periodic modulo m for the period p where p = gcd(p1, po). 


Lucas goes on to show that 
U2Um—1Um+1 — UU, {Ona = OU ee (8.1) 
and 
VV aed — Vwi — PO" Ug an (8.2) 


Actually, he gave an incorrect version of (8.2) which was later corrected 
by Bell [4]. On putting m =n+1 and m=n + 2, in (8.1), he produced the 
two formulas: 


3 3 -1 
UU n+2 - UR 41 Un—1 = Q” UV oan+1, and 


2 _ 7y2 —gmn-1 (8.3) 
UU n41U n+3 U 42Un—1U 41 = Q UU an +2- 


n 


He then states: 


The formulas [(8.1)] and [(8.3)] belong to the theory of elliptic functions 
and more especially to the functions which Jacobi denoted by the symbols 0 
and H. (In Lucas [17], he added the functions 0; and H; to this remark.) 


It seems, then, that Lucas, having observed the connection between his 
linear recurrence sequences (of degree 2) (U,,) and (V,,) to the singly periodic 
circular functions (see Sect. 4.1), was attempting to extend these sequences 
by appealing to a possible connection between some (unspecified) recurrence 
sequences (of order 3 or 4?) and doubly periodic elliptic functions. This was 
all he seems to have written concerning this idea until the remarks in his 
1891 circular, quoted in Sect. 6.1. 


These remarks provide us with some idea of where Lucas intended to go, 
but as they were published in the year of his death, he was unable to do 
anything further. The reference to Moutard is particularly revealing because 
Moutard, in investigating the Poncelet polygons, developed some results con- 
cerning a certain function A,,, which satisfies 


Am-nAm+n = A? Am—1Am+1 -— A? An—1An41; (8.4) 


the same equation (with Q = 1) as that satisfied by U,, in (8.1). Furthermore, 
if we are given values for a, b, c and Ag = 0, Ay = 1, Ag = 2abc, Az = 
agby + boca + ace, A, = 4abcazbece, where az = a7b? — b?c? + a?c?, bo = 
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a?b? + b?c? — a2c?, cp = —a7b? + b?c? + a?c?, then Moutard’s A, possesses 
the following features: 

e A,, is a symmetric polynomial in a, b, c, of degree n? — 1, over the 
integers; 

e (A,,) seems to be a divisibility sequence when a, b, c are the roots of 
a cubic polynomial with integer coefficients (Moutard did not actually prove 
that (A,,) is a divisibility sequence, but provided some results that strongly 
suggest this.); 

e A,, is related to the Jacobi H function; 

e Ifa=b, then A, = Q?-/2U, (P,Q), when P = 2c, Q = b?. 

Given these observations, it is easy to see how Lucas might have written 

the above comments. For more concerning this, see Sect. 17.4 of [35], where 
it is explained why Lucas might have thought that there could be solutions 
of (8.4) which were also linear recurrence sequences of order three or four. 


8.2 Elliptic Divisibility Sequences 


Although Lucas seems to have conceived the idea explained in the previous 
section as early as the mid to late 1870s, he appeared to make no progress on 
it beyond what is reported above. The next individual to try to find a con- 
nection between symmetric functions of the roots of a polynomial, through 
the intermediary of elliptic functions, to Lucas’ sequences was E.T. Bell in 
[4]. Bell succeeded in clarifying some of Lucas’ ideas, particularly those con- 
cerning elliptic functions, but was unable to make any substantial further 
progress, and as we have pointed out in Sect. 7.3 seems to have been rather 
pessimistic about any future advancement. Indeed, in March 23, 1930, in 
response to a letter written by Duncan Harkin, he [5] wrote: 


. you ask about a feasible generalization through Elliptic Functions of 
the Functions of Lucas. If I knew how to do this, I [s]hould be far from telling 
you, as I have tried my darndest to make some real progress on it myself for 
the past twentyeight [ sic] years. It is a tough nut of the first order. Whoever 
cracks it will make a contribution to the theory of numbers on a par with 
anything Fermat did. Go to it! 


It was Bell’s Ph. D. student, Morgan Ward, who made enormous progress 
on this problem. Although Ward’s 1928 Ph. D. dissertation, entitled Founda- 
tions of General Arithmetic, is mainly a work on formal logic, it seems that 
he must have discussed Lucas’ work with Bell because much of Ward’s sub- 
sequent work seems to have been motivated by the problem of generalizing 
Lucas’ sequences. For a brief summary of Ward’s mathematical work, see the 
paper of Lehmer [14]. 


202 8 Divisibility Generalizations 


Ward’s most important contribution to the Lucas problem was his memoir 
[34], which he published in the same journal in which Lucas published his 
memoir. In this work, he began with the recursion formula: 


Wm—nWm+n _ W2Wm-1Wm41 = W2.Wr-1Wrsi, (8.5) 


which is essentially the formula that Lucas regarded as being so important. 
Indeed, we observe that if W, = Qr("-)/?U,(P,Q), then by (8.1) (Wn) 
satisfies (8.5). Somewhat later in [34] Ward pointed out that if Y, = c"™W,, 
where c is any nonzero constant and r(n) = n? — 1, then the sequence (Y,,) 
also satisfies (8.5); he called these sequences (W,,) and (Y;,) equivalent. He 
derived a number of important results concerning sequences satisfying (8.5) 
and supplemented this work with further deductions in [34]. Notice that if we 
put n = m and assume that W2, 4 0 for some n, we see that Wo = 0; also, if 
we put n = 1, we get W; = +1. Ward confined his discussion of the solutions 
of (8.5) to those with initial conditions Wp = 0 and the normalized W, = 1. 
On putting m =n+1 and m=n-+ 2, in (8.5), it is easy to see by induction 
that (W,,) is completely determined, once values have been selected for W2, 
W3, W4. 


In what follows we summarize some of Ward’s findings: 


a The only sequences (W,,) satisfying (8.5) that are of arithmetic in- 
terest are those for which Wp = 0, W; = 1 and W2W3 4 0. Such (W,,) are 
said to be nondegenerate. 

um The sequence (W,,) is a divisibility sequence (called an elliptic divis- 
ibility sequence or EDS) if and only if Wo, W3, W4 are integers and W2 | W4. 

a If (W,,) is an EDS and p is any prime, then p must divide some W,, 
(r > 0). 

s If the prime p divides W,, (p > 1) in an EDS (W,,), but p { W, 
for any aliquot divisor r of p, eal , a rank of appearance of p in (W,,). If 
pt gcd(W3, W4), then p < 2p +1. 

a If is the least rank of appearance of a prime p in (W,,) and p{ W,41, 
then p | W,, if and only if p | n 

s Ifpis the least rank of appearance of a prime p in (W,,) and p | W,41, 
then p < 3 and p| W, for all n> p. 

= Suppose that p (> 3) is the rank of appearance of a prime p in the 
EDS (W,,). If m = v,(W,), the rank of apparition of p” in (W,,) is p or 
p"—™ p, according asn <morn>m. 

a If (W,,) is an EDS in which W3 and W4 are coprime, then gcd(W, 

Wr) = |W, gcd(m,n) )I- 
Ward defined the discriminant A of (W,,) to be the quantity given by the 
somewhat peculiar-looking expression: 


A =WiW2° — W3W,? + 3W7W,° — 20W.W3Ws 
+ 3W2W8 + 16WSWes + 8W2W3Wwe + We. 
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He called (W,,) singular if its corresponding discriminant is zero, and 
showed in Theorems 19.1 and 22.1 of [34] that any singular (W,,) must be 
equivalent to the Lucas sequences (U;,(P,1)) and that this will not be the 
case when A # 0. An example of a nonsingular (W,,) is 


0,1,1,—1, 1,2, -1, -3, -5, 7, —4, —23, 29, 59, 129, -314, —65, 1529, 
3689, 8209, —16264, 83313, 113689, —620297, 2382785, 7869898, (8.6) 
7001471, —126742987, —398035821, 1687054711, --- 


This sequence occurs as A006769 in the Online Encyclopedia of Integer 
Sequences (OEIS) [28]. 


All of this suggests that the EDSs possess properties that Lucas would 
have found very desirable, except for one important difficulty: in Sect. IX of 
[34], Ward provides several arguments which suggest that it would be most 
unlikely for (W,,) to be a linear recurrence sequence (X,,), of order k = 3 or 
4. This is mainly because the arithmetic behavior of (X,,) is rather different 
from that of (W,,); in particular, their possible ranks of appearance seemed 
to be different. He also mentioned that two consecutive terms of a linear 
recurrence sequence with k > 2 can be divisible by a prime p without having 
almost all terms of the sequence being divisible by p, but that could not be 
so for (W,,). Thus, although Lucas was looking in what would subsequently 
turn out to be a very productive direction, his apparent belief that solutions 
of (8.5) should be linear recurring sequences of order three or four seemed 
to ensure his lack of further progress. 


To continue with our discussion of elliptic divisibility sequences, we point 
out that Ward showed, using different language, that any nondegenerate EDS 
is associated in a specific way with a pair (E,P) made up of an elliptic 
curve £/Q and a point P € E(Q). This correspondence can be given as 
Wr = Un(P), where (x,y) is the n-division polynomial of E. For a given 
W2, W3, Wa, the associated elliptic curve is given explicitly in [34] and, by 
using more modern notation, in Sect. 10.3 of [8] or the appendix of Silverman 
and Stephens [26]. For (8.6), the associated elliptic curve £/Q is modeled by 
y? +y = 2° — x and P is (0,0). It may appear on a cursory glance that the 
terms in (8.6) are not growing very quickly, but when P is non-torsion, which 
is usually the case, it can be shown that the growth rate of the corresponding 
(non-singular) (W,,) is given by 


lim log|W,,|/n? = h > 0, 
nm—-+co 
where h is a constant, called the canonical height of P. Since this growth 


rate is much faster than that of any linear recurrence sequence (X,,), we see 
another reason why such a (W,,) cannot be a linear recurrence. 
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Only occasional attention was paid to elliptic divisibility sequences until 
about 2000, when they were recognized as being members of a class of non- 
linear recurrences that are more easily subject to analysis than most such 
sequences. This manageability comes about from the aforementioned close 
connection between EDSs and elliptic curves. As a more detailed study of 
these objects would take us well beyond the scope of this book, the inter- 
ested reader is referred to Chap. 10 of [8] for a good synopsis of information 
concerning EDSs and for several references. Further material can be found 
in the theses of Shipsey [25], Swart [30], and de Looij [15] and the paper of 
Silverman and Stange [27]. It should be mentioned here that (W,,) is a par- 
ticular Somos-4 sequence. For k > 4, a Somos-k sequence (S,,) is generated 
by the quadratic recurrence relation: 


[k/2| 
SntkSn = S- Aj Sntk—jOn45; 
j=l 
where aj, 7 =1,2,...,|k/2|, are constants and the initial values So,51,..., 


S,—-1 are given. We can easily deduce from (8.5) that 
WnrtaWn = WeWri3sWnii — W,W3Wei19; 


thus (W;,,) is a Somos-4 sequence for ay = W3?, a2 = W,W3. For more 
information and references concerning Somos-4 sequences, see the paper of 
Hone and Swart [10]. 


We conclude this section by mentioning that Stange [29] has applied EDSs 
and their higher rank generalizations, called elliptic nets, to cryptography. 


8.3 Linear Divisibility Sequences 


We have seen in the previous section that Lucas’ idea of moving in the direc- 
tion of elliptic divisibility sequences could have been very productive for him, 
but, unfortunately, he was laboring under a misconception which prevented 
further advancement. Also, the state of development of mathematics at the 
time would likely have precluded his making any significant discoveries, even 
if he were not burdened by his fixation on linear recurrences. It seems, on 
perusing what little he did say concerning the problem of generalization, that 
divisibility sequences would play a prominent role. Certainly, the sequence 
given by (6.1) and the elliptic sequences both possess this property. In view 
of the importance of this feature in his development of the arithmetic prop- 
erties of the Lucas sequence (U;,,), this is not surprising. It seems that he 
might have been searching for linear recurrence sequences which were also 
divisibility sequences. Indeed, Ward [32], [33] coined the term “Lucasian” for 
such sequences. In this section we will briefly investigate them. 
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Let k& be a positive integer. As in Sect. 7.1, we let the sequence (X,) be 
determined by an initial set of integer values Xo, X1, X2,...,Xz—1 and the 
linear recurrence: 


Xntk = PrXnte—1 — PaXnpe-o2t--.+(-1)*' Pe Xn, (8.7) 


where P,, Po,...,P, are given fixed integers. The sequence (X,,) is said to 
be a linear divisibility sequence (LDS) or a Lucasian sequence of order k if 
Xn | Xmn whenever m,n > 0. In what follows we will always use (X,,) to 
denote a linear divisibility sequence of order k. 


In connection with this study, it will be useful to discuss what Koshkin [11] 
has called a divisibility polynomial. We say that a polynomial f(x) with inte- 
ger coefficients is a divisibility polynomial if f(2™) divides f(a”) whenever 
m|n. Here m and n denote positive integers. It is easy to see that for f(x) 
to be a divisibility polynomial, it is necessary and sufficient that f(a) divide 
f(x") for all positive integers n. As examples of divisibility polynomials, we 
mention f(z) = «—1or f(x) = (*+1)(2™ — 1) with positive odd m. In [11] 
the author completely characterizes the divisibility polynomials and uses his 
results to construct some interesting LDSs. For example, he points out that 
if we select f(x) = (a + 1)(x* — 1) and put z = a/B, where a+ 8 = 1 and 
a8 = —1, we find that (X,,) with X, = Ln) Fsn/F, for any fixed odd positive 
integer s, is a LDS of order four. Similarly, for f(x) = (x +1)(a? +1)(#? — 1) 
and x = a/G, we can see that if X, = ¢LnLonF3n, then (X,,) is a LDS of 
order six. 


We next present some simple results, given in Hall[9], concerning any LDS 
(X,). 


Theorem 120 /f (X,,) is a linear divisibility sequence of order k and some 
nonzero X, (r > 0) has a factor m relatively prime to P,, then m | Xo. 


Proof By results in Sect. 7.1, we know that since gcd(m, P,) = 1, we have 
Xn = Xn (mod m) for all n > 0, where 7 is the minimal period length of 
(X,) modulo m. Hence, X, = Xo (mod m). It also follows by induction on h 
that X,, =X, (mod m) for all h > 0. Thus, X,,; = Xo (mod m), but since 
(X,,) is a divisibility sequence, we must have X, | X,,, which means that we 
must have m | Xo. 


Corollary 121 If Xo # 0, then any prime which divides some nonzero term 
of (X,) must be a divisor of gcd(Xo, Px). 


Thus, when Xp 4 0, there can only be a finite number of primes which 
can divide all of the terms of (X,,). However, when Xo = 0, then any prime 
which does not divide P;, will occur as a divisor of infinitely many terms of 
(X,,). It is for this reason that we will assume that Xo = 0 in (X,,); also, since 
(X,) is a divisibility sequence, we must have X, | X,, for all n > 0. Thus, if 
X, = 0, then X, = 0 for all n > 0, and the resulting sequence is not very 
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interesting. We exclude such sequences from consideration. When X, 4 0, 
then (X,,/Xj1) is a linear divisibility sequence of order k; hence, there is no 
loss of generality in assuming that X; = 1. Such a divisibility sequence is 
said to be normalized. 


We can now generalize the idea of a primitive divisor introduced in 
Sect. 4.4. We will say that a prime p, which does not divide Px, is a primitive 
divisor of Xp, a term of (X,,), if p divides X,, but p{ X, for0<r<h. We 
have the following theorem. 


Theorem 122 If p is a primitive divisor of Xp, then h | r(p). 


Proof We let d = gcd(h,7), where we will denote r(p) by 7 here. There must 
exist positive integers x,y such that ha — ry = d. Since (X,,) is a divisibility 
sequence, we must have p| Xz, . Also, by the definition of 7, we must have 
0 = Xan = Xen—ry = Xa (mod p). As p is a primitive divisor of X;, we see 
that we can only have d > h; hence, d = h and d |r. 


Further properties of LDSs where the discriminant of the characteristic 
polynomial is not zero were proved in Ward [31], but little progress was 
made on the problem of determining the LDSs (if any exist) for a given char- 
acteristic polynomial. In what follows we will assume that the LDS (X,,) is 
nondegenerate; that is, no quotient of any two of the roots of its characteristic 
polynomial is a root of unity. This means that the polynomial cannot have 
two roots to a common power that are equal. 


Now suppose f(x) and g(x) are given by 


f(z) =a" — Pyo®* + Pon®-? —...+ (-1)*P,, 

g(a) = 2" — Qin"! + Qoa”? — ...+ (-1)"Qn, 
where P; (i = 1,2,...,k) and Q; (i = 1,2,...,h) are integers and the dis- 
criminants of f(x) and g(x) are both nonzero. Let pj, p2,..., Px be the roots 


of f(a) and o1,02,...,0, the roots of g(x). Recall that the resultant of f 
and g, written R(f,g), satisfies 


k 


R(f,9) = [J 9(o)- 


i=l 


If we let f,(a) be the polynomial with roots p}, py,..., p% and gn(x) the 
polynomial with roots of,o3,...,07. We can put R, = R(fn,gn). If all 
the roots 1, P2,---, Pk; 01,02;---,0p are distinct, we can define r, by rp, = 
R,/R,. Ward [32] remarked that when n > 1, ry, is an integer. Also (r,,) is a 
linear recurrence sequence which is also a divisibility sequence and is therefore 
a Lucasian sequence; Ward called (r,) an R-sequence. He was unable to find 
any Lucasian sequence which was not an R-sequence; indeed, he stated that 
“it appears probable that all Lucasian sequences may be exhibited as R- 
sequences or divisors of R-sequences.” This was pretty much the way matters 
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stood until 1990 when Bézivin, Petho, and van der Poorten [6] were able to 
prove Ward’s statement. 
We can write 7, above as 


k pt —o® 
rn =] [| -—*. (8.8) 
i=1j=1 ome 
In [6], the authors observe that even if all the roots p1, p2,.--, Pk; 01,02;+-+5Th 


are not distinct, by using reasoning similar to that in Sect. 2.1, on each term 
of the above product (8.8), the generalized R-sequence r,, can be put in the 
form: 


m= ny" TT (a? — 8P)/(a% — 6). (8.9) 


Here y, a;, 2; represent fixed nonzero algebraic numbers and m is some non- 
negative integer. By making use of deep results in the theory of exponential 
polynomials and the Hadamard quotient theorem, they were able to prove 
the following result. 


Theorem 123 /f (X,,) is a nondegenerate LDS, then there exists a resultant 
sequence (rn) of the form: 


Tr =n"™ [](er — B?)/(au — Bi), (8.10) 


such that for alln > 0, we must have Xy | Tn. 


Recently, by making use of the theory of generalized Vandermonde deter- 
minants, Barbero [3] was able to give an elementary proof of a more precise 
version of Theorem 123. We give below his result in the specific case where 
the characteristic polynomial of (X,,) has a nonzero discriminant, but his 
overall result is much more general than this. 


Theorem 124 If (X,,) is a nondegenerate LDS of degree k whose charac- 
teristic polynomial has roots p1,p2,.-.-,Pr, then for alln > 0, we must have 


Xn | Tn, where 
—_ Il pi — PF 
i<icj<k Pi PI 
Notice that r, here is the same as Lucas’ A(p?, p3,...,e%)/A(p1, p2,---, 
Pk) in (6.1). 
Theorem 124 was used by Abrate et al. [1] to prove the following pretty 
result. 


Theorem 125 /f (X,,) is a nondegenerate LDS of degree 4, then its charac- 
teristic polynomial must be of the form: 


at — Px? + (R+2Q)2? — PQxr + Q?, 


for some integers P, Q, R. 
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The above polynomial is the characteristic polynomial F(a) for the ex- 
tended Lehmer functions in Sect. 7.3 when k = 2 and R = Ps. This is not a 
coincidence because in this case, 2, = Vi,» and (2,) is a divisibility sequence. 
We will discuss this further in the next section. 

Unfortunately, Theorem 124 only provides us with a sequence whose terms 
must be divisible by those of (X,,), but does not give us a formulation for 
X,,. In [18] Oosterhout made the following interesting conjecture (Conjec- 
ture 3.5.6). 


Conjecture 126 Let (X,,) be any nondegenerate LDS and let 1, p2,..., Pk 
be the roots of its characteristic polynomial. Let I’ be the subgroup of the 
group of all nonzero algebraic numbers generated by the p;. Then, there 
exists some c € Q, elements , 1, 72,---,; Ym inI and divisibility polynomials 
P1;P2,--+,Pm Such that 


Xn = cA"pi(¥1')p2(y2) ---Pm(Ym)> 


for all n > 0. 


In the case that I is non-torsion, a sketch of the statement and a rather 
rough proof of this conjecture can be found in Sect. 5 of Everest and van der 
Poorten [7]. 

The difficulty with using this conjecture is that it is only a necessary 
condition on X,,. For any given value of k, it provides little information about 
what the values of A, 71, 7y2,---;Y%m Should be; nevertheless, Oosterhout was 
able to make use of it to characterize conditionally all the LDSs of order 2, 
3, and some of order 4. Recall, however, that we saw in Sect.6.2 that we 
still have no proof concerning the identity of the LDSs of order three; thus, 
this conjecture, if true, is likely very difficult to prove. If we put m = k, 
pi(x) =x2—-1 (4 =1,2,...,k) and insist that X, = 1, we get: 


k 


Xn =X" T] (1 P)/(d — ws 


i=1 


a sequence we will consider in the next section. 


8.4 Extended Lucas Sequences 


In Roettger et al. [23], it is argued that the Lucas sequences can be charac- 
terized by four basic properties: 


(1) There are two sequences of integers (when n > 0). 


(2) Both sequences satisfy the same linear recurrence relation. 
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(3) One of the two sequences is a divisibility sequence. 


(4) There are both addition and multiplication formulas for the terms of 

the sequences. These are identities respectively of the type Xmin = 

-+, Xmn = +--+, where the right-hand sides are polynomial expressions 
involving the mth and nth terms of the sequences in (1). 


Given what we do know about Lucas’ attempt to generalize his sequences, 
it seems that any sequences which he might have found acceptable as a proper 
generalization of U and V should possess these four properties. We remark 
that Roettger’s sequences in Sect. 6.6 do satisfy these four properties, but are 
there any others? 


One candidate is the Pierce sequences. In 1916 Pierce [19] produced two 
sequences with properties very similar to Lucas’ U and V. He denoted these 
by (A,,) and (S;,), where 


k k 
An =|] (1— pf?) and S, = ] [+ 97). 
i=1 i=1 
Here pj, P2,---, Px are the roots of a monic polynomial f(a) of degree k with 


integral coefficients. He was able to derive a number of features of these 
sequences which are very similar to those of Lucas’ (U;,) and (V,,). For exam- 
ple, he showed that (A,,) is a divisibility sequence. He also obtained several 
number theoretic results concerning these sequences, particularly when the 
degree of f(a) is three. Later, Lehmer [12] extended some of Pierce’s results, 
showing, among other things, that each satisfies a linear recurrence relation. 
Indeed, Lehmer [13] employed these functions in a test, which makes use 
of the factors of N? + N +1 to demonstrate the primality of N. Unfortu- 
nately, Pierce’s functions are difficult to compute, which means that using 
them is not very practical. (Lehmer explains this in detail on pages 50-60 in 
his unpublished 1968 notebook #11, which, together with all the rest of his 
notebooks, is now in the Bancroft Library at U.C. Berkeley.) While Pierce’s 
work certainly produced a kind of extension of the Lucas functions, it is very 
unlikely that Lucas was thinking along these lines, because nowhere in his 
work does he allude to anything like these sequences. Also, Pierce’s sequences 
do not become (U,,) and (V,,) when f(a) is of degree two. 


In [23] the authors, on considering Conjecture 126 and (8.9), generalized 
Pierce’s S,, and a normalized version of his A, by putting 


1-7! 
v =] —* and V, =" JJ +97), (8.11) 
i=1 % i=1 
where A and 7,72,---,Yx are simply constrained to be nonzero algebraic 


numbers such that V; 4 0 and both (U,,), (Vn) are sequences of integers 
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for n > 0. Notice that, just as in the case of the Lucas sequences, we have 
Vi = Uan/Un. 

Under the above constraints, it is easy to prove that if k = 1, P = A(y,+1), 
Q = »*71, then both P and Q must be integers because Vi = \(y + 1), 
Vo = A7(1 + 72), Us = A7(1 + y1 + 77) are all integers. In this case, we see 
that in (8.11) Un = Un(P,Q), Vn = Vn(P,Q), where a = A and B = Ay1. 
Thus, (8.11) represents a generalization of the Lucas functions. 

For any fixed value of k, conditions on A and 71,72,..-,Yx are derived 
in [23] to ensure that both (U,,) and (V,,) are sequences of integers which 
satisfy the same linear recurrence. For example, one of these conditions is 
that Q (= A?y172.--Yxn) be an integer. It is also shown that under these 
same conditions, (U,,) is a divisibility sequence. In the case that k = 2, we 
must have T;, 72, and Q integers, where 


Ti = A(1+91)(1+72), To = 7 (ya +42)(1 +7172) +27 9172 and Q = A?9172. 


Thus, if we put P, = 7; and Py + 2Q = T>, we must have P,, Py and Q 
integers. In this case we find that both (U,,) and (V,,) satisfy the fourth-order 
linear recurrence: 


Xnt4 = PiXn+3 = (P2 a 2Q)Xn42 + PiQXn+1 = Ox. 


with initial conditions Up = 0, U; = 1, Uz = P,, U3 = P? — Py -3Q, Wo = 4, 
Vi = Pi, Vo = P? — 2P) — 4Q, V3 = P,(P? — 3P2 — 3Q). Notice that the 
characteristic polynomial for (X,,) is the polynomial in Theorem 125. We 
have: 


Un = Vin and Vp =2Von + PiVins (8.12) 


where (Vo) and (Vi,,) are the generalized Lehmer sequences for k = 2 
in Sect. 7.3. The sequences (U,,) and (V;,,) were discussed in some detail by 
Williams and Guy in [37], [38], and further properties of them were developed 
in Roettger et al. [24]. Suffice it to say here that they possess the four proper- 
ties of the Lucas sequences mentioned at the beginning of this section. There 
is also a law of repetition, a law of appearance, and a Fundamental theorem 
analogous to those for the Lucas sequences. Thus, these are evidently the 
fourth-order analogues of the Lucas sequences. 


When k = 3, the situation is somewhat more complicated because the 
sequences satisfy a linear recurrence of order 8; this is given in Theorem 5.3 
of [23]. A difficulty, however, arises when we try to develop the addition and 
multiplication formulas because it becomes necessary to introduce a third 
sequence in order to do this; thus, the sequences when k = 3 cannot in 
general satisfy the four conditions given earlier. This, of course, is also likely 
the case for k > 3. Nevertheless, there is a special case when k = 3 which 
does allow us to require only two sequences; this is the case when 772773 = 1. 
Here we must have A an integer, denoted by R, and the resulting sequences 
(U,) and (V,, — 2R"”) satisfy the same linear recurrence of order six: 
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Xnag =51 Xn15 — (Sa + BR?) Xnsa + (Sp + 2R7S1)Xnis 


8.13 
— R?(S2+3R?)Xnie + R°StXn41 — R°Xa, me) 

where $1, S2, and S3 are integers such that 
S3 = RS} — 2RS2 —4R°. (8.14) 


In this case it turns out that p1, p2, ps, where p; = R(yi +7, ') (i= 1,2,3), 
are the roots of the cubic polynomial: 


xe = 51x + Sox = S3. 


If we put W,, = Vp —2R”, it is shown in [23] that (U,) and (W,,) also possess 
the four properties mentioned at the beginning of this section. Also, if we put 
Q = R?, P, = S,, Pp = Sz, P3 = S3, we see that (8.13) is the recurrence for 
the generalized Lehmer functions when & = 3. By using formulas in Sect. 7 
of [23], it can be shown that 


U, = UqU 1» + U3U 2, and Wr = 3Von + PiVin + (P? — 2P2)Van, (8.15) 


where Uz = P,; + 2R, U; = P? + RP, — Py) — 3R?. When k = 3 and P; = 
RP? — 2RP, — 4R°, this provides the connection similar to (8.12) (in the 
k = 2 case) between the (U,,) and (W,,) sequences, and the corresponding 
generalized Lehmer sequences. 


The arithmetic theory of the (U,) and (W,,) sequences defined here is 
very rich and would certainly have appealed to Lucas. Several arithmetic 
results concerning them can be found in [23] and in Roettger and Williams 
[20]. Indeed, it has been shown by Roettger and Williams [21] that special 
instances of these sequences can be used to produce a necessary and sufficient 
primality test for integers N such that N® — 1 is divisible by a large power 
of 7. Given some prime q (= 1 (mod 7)) such that N(@-)/7 41 (mod N), 
this test executes in O(log N) modular multiplications modulo N and is very 
similar to a p = 7 version of the general test of Theorem 118, but is more 
strictly analogous to the p = 5 test of Theorem 119. 


We also point out that if P, Q, R are arbitrary integers and we put S$; = 
PQ — 3R, Sz = P?R+Q? —5PQR+ 3R?, S3 = R(P?2Q? — 2Q? — 2P2R+ 
4PQR — R?), then $1, $2, $3 satisfy (8.14) and the corresponding (U;,) and 
(W,,) are the Roettger sequences discussed in Sect. 6.6. 


We have seen in this section that Lucas’ intuition was correct about the ex- 
istence of fourth-order extensions of his sequences. It seems that he was wrong 
about the existence of third-order analogues which satisfy a cubic recurrence, 
but there are sixth-order analogues, and the roots of cubic polynomials play 
a role in this theory, particularly for Roettger’s sequences. Of course, there 
exist further sequences for k > 3, but it seems very likely that Condition 
(1) will be violated. This, however, should not be regarded as a significant 
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obstacle to further advancement of the theory. Indeed, in the general case 
of k = 3, only a third function is required in dealing with linear recurrences 
of order eight. It would be of some interest to discover what further results 
might be derived for the sequences given by (8.11) when k > 3. 


8.5 Odd and Even Extensions of the Lucas Sequences 


Consider a linear recurring sequence (X,,) with characteristic polynomial 
F(x) of even degree 2k having zeros a;, 3; such that a;8; = Q (i =1,...,k). 
By analogy to similar definitions in the theory of functions, we say that 
(X,) is even if X_,» = X,/Q” and (X,,) is odd if X_, = —X,,/Q” for all 
integral values of n. For example, (Un(P,Q)) is odd and (V;,(P,Q)) is even 
by (2.7); also, by (10.2.3) of [36], the generalized Lehmer sequences (Uj,») 
(i = 1,2,...,k) are all odd and (V;,,) (¢ = 1,2,...,k) are all even. Hence, 
the fourth- order LDS (U,,) mentioned in the last section is even. Let D be 
the discriminant of F(a) and suppose that D # 0. It not very difficult to see 
that (X,,) is even if and only if 


Xn = c1(at + BY) + cola + BE) +... + celag + Be); 


and (X,,) is odd if and only if 


Xn = e1(ay! — BY) + c2(az — BE) +... + ce(az — Be). 


Here the c; (i = 1,2,...,) are constants determined by the initial conditions 
on (X,,). These results are proved in Sect. 2 of Williams and Guy [39]. 


The object of [39] was to find which of the linear divisibility sequences of 
order four are either even or odd. The discussion there was confined to LDSs 
with characteristic polynomial of the form 


F(a) = 2* — Px? + (R+2Q)2? — PQr+ Q?, (8.16) 


but as we have seen in Theorem 125, these are the only nondegenerate LDSs 
of order four. By (7.21), the discriminant of (8.16) is D = EA*Q?, where 
A= P?—4R and E=(R+4Q)" — 4QP?. 

It is by no means clear that any nondegenerate LDS of order four must 
be either even or odd; in fact, when P = 0, there exist fourth-order LDSs 
which are neither even nor odd. For example, consider P = 0, R= Q = -1; 
we get X, = F;, when n is even and X,, = L, when n is odd. It follows since 
X_m = Xm/Q™ when m is odd and X_», = —Xm/Q™ when m is even, that 
(X,) can be neither even nor odd; nevertheless, (X,,) is a nondegenerate 
fourth order divisibility sequence. However, it is also true in this case that 
we can represent X, as U,(5,1), which is an odd fourth-order divisibility 
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sequence for Q = 1. In general, when P = 0, then either (X,,) is odd for a 
given Q or, if not, it is when we replace Q by —Q. 

Note that if (X,,) is any odd fourth-order LDS, then it is completely de- 
termined once we know X2, because we must have X_,; = —1/Q, Xo = 0 
and X, = 1; in fact, it is easy to show that the terms in such a sequence 
will always be of the form X, = Upon + X2Ui, for the (&K = 2) generalized 
Lehmer functions with characteristic polynomial (8.16). However, this is only 
a necessary condition and is not sufficient. 


Let A = Su? and E = Gv’, where S, G, u, v are integers and § and G are 
both square-free. Let (X,,) be any non-null linear divisibility sequence with 
characteristic polynomial F(a) given by (8.16) with D # 0 and P 4 0. In 
[39] the following results are proved: 


e There is one and only one even (X,,) for any given F(a); 
e IfSAlandGF¥1, (X,) is either even or odd; 
e IfSAlandG+#1 and GS, there can be no odd (X,,); 


e If S =1 and G #1, then the only possible odd (X,,) is a funda- 
mental Lucas sequence. 


We have already seen that the sequence in the first bullet is the sequence 
(U;,), mentioned in the previous section (/ = 2), whose first four terms are 0, 
1, P, P? — R—3Q. Such a sequence is called standard in [1]; it is shown there 
that it can always be written as the product of two Lucas-like functions. These 
two functions can be expressed as (a’!— 67')/(a1—(1) and (a} — 62) /(a2—82), 
where 


a1 +61 = (A+ B)/(2\/Q), 0161 = Q, a2+fo = (AFB)/(2/Q), a252 = Q, 


and A= /R+4Q+2P,/Q, B= J/R+4Q — 2PV/Q. 

While we emphasize that this has not yet been proved, it appears that 
there are no nondegenerate odd fourth-order LDSs when G = S$ # 1 or when 
S #1 and G =1. Thus, considering the above results, we could only expect 
to find an odd fourth-order LDS (X,,) when G = S = 1. 

If G = 1, it is shown in [38] that P? = rire, R = qre+ geri — 4q1@2, 
Q = qq and E = (qir2 — geri)’, where r1, 41, T2, 2 are integers such that 
ry > 0, ro > 0, ged(ri, qi) = 1, ged(re, qa) = 1 and rire is a perfect square. 
If G = 1 and (X,,) is even, then it is also shown in [38] that 


[Xn] = |On(V71; 21) Un(V72, @2)I- 


From results in Sect. 4.4, it follows that the sequence (X,,) is clearly a divis- 
ibility sequence. Thus, if G = 1 (E is a perfect integral square) and (X,,) is 
even, then (X,,) must be a LDS of order four. 

If G = 1 and S ¥ 1, it is shown in [39] that any odd fourth-order LDS 
(X,,) must have the form: 
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Xn — Val/71 n)Un (72, q2)/V71, 


where (X,,) satisfies the fourth-order linear recurrence 
Xnta = PXn43 — (R+2Q)Xnt2 + QPXn4i — Q?Xn, 


but it is not always the case that this (X,,) will be a divisibility sequence. 
For example, if r1 = ro = 7, q. = 2 and q. = 141002, we see that (X,) 
behaves like a divisibility sequence until we arrive at the 28th term, where 
we discover that X 4 is not a divisor of X2g. However, if we put r = 7, 
d=, T2 = Vs(Vr, 9)? = rV.(r,q)? and q2 = qs, where s is any fixed odd 
positive integer, we get: 


Xn = Vn (r, qUns (, q)/Us (r, q)- 


Here U and V denote the Lehmer sequences. (This X,, is essentially the 
same function as that discovered by Koshkin when f(a) = (a + 1)(a? — 1) 
and « = a/@, where a+ 8 = \/r and af = q.) It can be shown that 
this (X,,) is a fourth-order odd LDS with characteristic polynomial (8.16); 
also, by using some identities which are satisfied by the Lehmer functions 
(Sect.4.4), we have P = rV,(r,q), R = qVs_1(r,q)Vei1(7,¢), Q = g?*, 
A = (r ~ 490, (r,9)?, B = (arr - 49)0.-1(7, )Os41(,9))", and X_ = 


(r — 2q)V3(r, q). Notice that in this case, we have S' = 1. 

In the particular case of s = 3, we have P = r(r —3q), R = q(r —2q)(r? — 
Arq + 2q7), Q = qd and X_ = (r — 2q)(r — 3q), whereas for the standard 
sequence, we would have X29 = P = r(r — 3q). If we put r = 5 and q = 1, we 
get P = 10, R = 21, Q = 1, the standard sequence (La, — L2,)/4 with terms 


0,1, 10, 76, 540, 3751, 25840, 177451, 1217160, 8344876, 
57202750, 392089501, --- , 


and the odd sequence (F4, + Fo,)/4 with terms 
0, 1,6, 38, 252, 1705, 11628, 79547, 544824, 3733234, 25585230, 175356611, --- 


These are listed as A215465 and A215466, respectively, in the OEIS. As noted 
in [37], the terms of the sequence A215465 can be written as LD, F3,/4 when 
n is even and F,,L3,/4 when n is odd. This same sequence was also produced 
by Bala [2], using a different procedure. 

The above sequence with different values for r and q will also yield the 
odd LDSs: A238536 (r = 1, g = —1), A238537 (r = 4, g = —1), and A238538 
(r =9, q = 2). 

If we next select (X,,) here by setting X2 = r(r — 2q), then it turns out 
that this (X,,) is also an odd fourth- order LDS with characteristic polynomial 
(8.16) and, as before, P = r(r — 3q), R = q(r — 2q)(r? — 4rq + 297), Q = dt. 
Here, it can be shown that 
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Xn = Von (1, Un(Vr, 9). 
If we put r = 5, q = 1, we get the sequence: 
0, 1, 15, 128, 945, 6655, 46080, 317057, 2176335, 14925184, 102320625, --- , 


of general term Fy,, — 2F,, which appears as A127595 in the OEIS. 

If we put r1 = t?, @ =4, T2 = (t? — 2g)”, g2 = q? and X2 = 2°, we find 
that A = t?(t? — 4q)? and X,, = U,,(t,q)?. The corresponding sequence (Xp) 
is clearly odd and a fourth-order LDS. For t = 1, gq = —1, we get A056570 in 
the OEIS. 


Most of the above sequences were found, using a different approach, by 
Oosterhout [18]. Also, the Lehmer sequence U is a fourth-order linear divisi- 
bility sequence as is (anU,(R,Q)), where (an) is a special periodic sequence 
of integers such that a; = 1 and a, | azn. Indeed, the a sequence must be 
some V sequence, where Q = +1 and R = Q, 2Q or 3Q. 


We have seen, then, that there are infinitely many odd fourth-order LDSs; 
are there any others apart from those mentioned here? Possibly, the answer 
to this question is yes, but no nontrivial examples are known to the authors, 
none were found during an extensive computer search and none appear in 
the OEIS. For more information concerning fourth-order odd divisibility se- 
quences, see Roettger and Williams [22]. 

When k = 3, we have seen that if Q = R? and P; = RP? — 2RP2 — 4R°, 
the sixth-order linear recurrence 


Xn+6 =P\Xn+5 > (P2 + 3Q)Xn+4 + (P3 + 2QP1)Xn+3 
oa Q(P» + 3Q)Xn+2 + OPP Kai — Ox, 


is satisfied by the divisibility sequence (U,,), where Up = 0, Ui; = 1, U2 
P,+2R, U3 = P? + RP, — P, —3R?. Since the generalized Lehmer sequences 
(Uin) (@ = 1,2,3) are all odd, we see by (8.15) that (U,,) is an odd LDS of 
order six. Thus, we need only know the values of Uz and U3 and (8.17) to 
completely specify the sequence. Particular instances of this sequence occur 
in the OEIS as A180510 with P, = —1, P23 = —5, R=1, Ps = 7 and A005120 
with P, = —3, P27 = 2, R= 1, P3 = 1. Others are: A0Q01351, A001945 and 
A006235. 


Apart from the above, very little seems to be known about LDSs of order 6. 
However, if we have integers Q, L, S such that L? | $?Q and put P; = L+S, 

= LS+T, P3 = LT, where T = L?+QS?/L? — 4Q, then it can be shown 
that the sequence of integers (X;,)n>o0 with 


Xn = Un(L, Q\Un(./(S + 2L)/L, 1) 


is an odd sixth-order LDS which satisfies (8.17). We remark that this sequence 
is of the form given in the first formula of (8.11) with 7, 4 y2 = 73. There 


(8.17) 
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are several examples of these sequences listed in the OEIS. For example, if 
we put L = 6, Q = 1, S = 42, we get A003755; others are A003753, A003767, 
A006235, A006237, and A169630. We also mention Koshkin’s sixth-order LDS 
(X,,) with 


Putting P= 1, Q = —1, we get X, = Ly Lan F3p,/6. 
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Chapter 9 


Prime Density of Companion Lucas 
Sequences 


— Which man first watched the moon? 
— Which moon first lit the face of aman? (Wang-Wei) 


To be a great mathematician one must have the soul of a poet (Karl Weierstrass 
in a letter to Sofia Kovalevskaya) 


It seems to me that the poet has to perceive that which others do not perceive, to 
look deeper than others look. And the mathematician must do the same thing (Sofia 
Kovalevskaya) 


Abstract The central objective reason for this chapter is to show that the 
relative proportion, within the set of all primes, of primes which divide at 
least one term of a companion Lucas sequence can be computed precisely and 
unconditionally. For instance, we have known since Hasse that, on average, 
out of 24 primes, 17 divide some terms of the sequence (2” + 1)n>0. Sec- 
tions 9.1 and 9.2 are preliminary. Section 9.1 makes the word “proportion” 
precise, using density concepts. Section 9.2 mentions some of the rudiments of 
algebraic number theory necessary to state the classical density theorems for 
prime numbers, and, in particular, the celebrated theorem of Cebotarev'. Fa- 
miliarity with abstract algebra, including rings, ideals, and quotient rings, and 
Galois theory is assumed. Section 9.3 applies the Cebotarev density theorem 
to a special class of recurrent sequences, the polynomial sequences (P(n))n, 
where P is a monic polynomial in Z[x]. This section can be skipped without 


” but sometimes 


+ The usual English spelling of this Russian surname is “Chebotarev, 
“Tchebotaref,” and its spelling varies a great deal in Western languages, even though they 
use the same Roman alphabet; thus, we prefer the simpler “Cebotarev,” which at least 


contains the same number of letters as its cyrillic counterpart. 
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any hindrance to reading the rest of the chapter. The early inquiries and 
works that later led Hasse to find a complete answer to the density of primes 
that divide companion Lucas sequences of the type (a”+1),, a a square-free 
integer, are discussed in Sect. 9.4. The method of Hasse, and of Lagarias, who 
expanded the method of Hasse to some special second-order linear recurrences 
with irrational roots is presented in Sect. 9.5. In Sect. 9.6, we look at these 
prime density questions heuristically. The heuristics do not use the density 
theorems of algebraic number theory, save the Dirichlet density theorem, and, 
with simple tools, often permit an intuitive insight into these densities. The 
creative works of Laxton, who discovered a group structure for second-order 
recurrences that share the same characteristic polynomial, a group relevant 
to divisibility by primes, and extensions of this group to higher-order recur- 
rences, are the object of Sect. 9.7. For m-th order linear recurrences, m > 3, 
we say, after Ward, that a prime is a maximal divisor of the sequence, when it 
divides m—1 consecutive terms of the recurrence. Section 9.8 shows how both 
the method of Hasse and Lagarias, and the aforementioned heuristics, apply, 
in some special instances, to the calculation of the prime density of maximal 
divisors of some linear recurrences. The final section reports about a foray 
made into the parallel world of polynomials over finite fields. The density of 
monic irreducible polynomials that divide the polynomial companion Lucas 
sequence (X” + 1), is investigated. 


9.1 How Do We Measure the Size of a Set of Primes? 


Let (Pn)n>1 denote the sequence of rational prime numbers, where primes 
are ordered according to size, i.e., Pn41 is the least prime larger than py. 
The first few terms of the sequence are p, 2, po 3, p3 5, pa 7, 
and ps = 11. We denote the set of all primes {p,; n > 1} by P. As is 
usual, the counting function for the number of primes not exceeding some 
positive real number x is denoted by (x). For instance, (10) = 4. The 
prime number theorem (PNT) conjectured by Gauss in his teen years in 
the late eighteenth century and proved independently and in the same year, 
1896, by Hadamard and de la Vallée Poussin, says that (x) is asymptotically 
equivalent to «/loga as x tends to infinity, where log is the Naperian or 
natural logarithm. The PNT is equivalent to the statement that p, ~ nlogn, 
as n tends to infinity. (Indeed, n = (pn) ~ pr/log(pn) by the PNT. Hence, 
Pn ~ nlog py. Since p, tends to infinity, the logarithms of p,, and of n log py, 
must be asymptotically equivalent. Since log log p,, = o(logp,), we get that 
log pn ~ logn, as n — oo. This yields p, ~ nlogn. For the converse, we 
obtain that pp ~ Pn41, logpn4i ~ logpn ~ logn. Thus, if py < @ < pri, 
then r(x) =n and x/logz ~ pp/log pn ~ nlogn/logn =n.) 


We denote the cardinality of a finite set A, i.e., the number of elements 
in A, either by |A| or #A. If « > 0 and S is a subset of P, then S(x) is 
the set of primes in S$ that do not exceed x. Thus, #S(x) is the number of 
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primes in S(«). Finding out whether a set of primes, S, is finite or infinite 
is often a first step toward measuring the size of S. It can be a very difficult 
problem: to date we have not found a proof of the infinitude of the twin 
prime set in spite of all the heuristic and numeric evidence. But if we know 
S to be infinite, then seeking bounds, or better asymptotics, for the counting 
function #:S(«) is the next step, or else determining whether S possesses a 
density relative to the set P. If w is a function from P to the nonnegative 
real numbers such that >/,-p w(p) diverges, then w is a weight function to 
which we may associate a density d,: a subset S of P is said to possess a 
dw-density, d = d(S) (d € [0,1]), if and only if 


de Ym CPEs) UP) 


There are a few general consequences of this definition. A finite subset 
always has a zero d,,-density. Also, the d,,-densities are finitely additive, i-e., 
a finite disjoint union U?_, A; of subsets of P, each of which possesses a 
dy-density, has a d,-density equal to the sum 57)", d,,(A;). Hence, adding 
or removing finitely many primes from a given set does not alter its dy, 
density. However, d,,-densities are not countably additive since d,(P) = 1, 
P = Usi{pi} and 7,5, du({pi}) = 32;5,0 = 0. These few properties are 
often expected to hold for any density concept whether the density is defined 
as in (9.1) or differently. 

We present next the three most common densities used in relation to sets of 
rational primes, i.e., the natural, the analytic, and the logarithmic densities. 

The natural or asymptotic density of a subset S of P (i.e., the natural 
or asymptotic density of S within the set of primes) is defined in analogy 
with the notion of natural density for subsets of the natural numbers (see 
Sect. 5.5.4). The set S has a natural density d € [0,1] if and only if the limit 


lim #5(2) 


2-00 1 (ax) 


exists and equals d. This corresponds to using the constant weightfunction 
w(p) = 1, p prime, in (9.1). 

Upper and lower asymptotic densities are defined as the upper and lower 
ae as x tends to infinity. Such 
densities exist for all subsets of P. (They are not in general finitely additive.) 
A set has an asymptotic density if and only if its upper and lower asymptotic 
densities coincide. 

The analytic or Dirichlet density of S C P is defined as 


lim Yves 1/P* ue 
soit bags 1/p*’ 


limits of the function, respectively: x HH 


if the limit exists. 
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As is well known, Euler showed the divergence of the series )7,, ao More 
precisely, we have: 


iL 
S- — ~ log log 2, (p € P) (9.3) 


pSu 


as & goes to infinity; see for instance [4, Thm. 4.12]. But if s > 1 is a real 
number and x, = 1/p%, then the series }>,,., 2, converges. Indeed, the 
series )>,,5 Un converges iff, by the PNT, >>,,.3 1/(nlogn)* converges. But 
the latter series is dominated by the convergent Riemann series }>,, 1/n*. 
Thus, the expression (ives 1/9?) / (| ose 1/p*) in (9.2) is well defined for 
alls > 1. 

As turns out [11, Sect. 2], analytic density for a set of primes is equivalent 
to another useful density: the logarithmic density. A subset S' of P possesses 
a logarithmic density 6 = 6(S) if and only if the limit 


: nests) 1/p 
lin ——— 
zoo loglog x 


exists and equals 6. Thus, by (9.3), logarithmic density corresponds to using 
the weight function w(p) = 1/p in (9.1). 

If a set of prime numbers possesses a natural density d, then it must have 
a logarithmic density 6 with 6 = d. The converse is false. There are sets of 
primes with a logarithmic density, but with no natural density. For instance, 
the set D, of prime numbers whose decimal expansion begins with the digit 1, 
ie., Dy = {11,13,17,19, 101, 103, 107, 109, 113, 127,...,199, 1009,...} does 
not have a natural density: the upper and lower asymptotic densities of D, 
are distinct and can be calculated using the PNT. They are 5/9 and 1/9, 
respectively. But the logarithmic density of D, exists. Its value is 6(D,) = 
log,) 2 ~ 0.301. The set of natural numbers with first digit 1 is easily seen 
not to have an asymptotic density within the positive integers and, also, to 
possess upper and lower asymptotic densities of 5/9 and 1/9 as well as a 
logarithmic density equal to log,, 2. General conditions were given [27] on 
subsets of the natural numbers, which include the subset of prime numbers, 
for the natural and logarithmic densities to be preserved when conditioning 
with respect to such subsets. 

More generally, 6(D,) = logi9(1 + 1/k), where D, is the set of prime 
numbers whose first decimal digit is k, 1 < k < 9. (Some readers may rec- 
ognize the exact frequency given by Benford’s law. However, Benford’s law 
stipulates that many lists of numbers obey this frequency with respect to 
asymptotic density.’ In the present case, this frequency only holds with re- 


+ If you have the patience, go to your favorite supermarket (Carrefour, Loblaw Companies, 
Safeway, etc.) and write down the first nonzero digit for the prices of some 400 items. 
You should get fairly close to log,j(1 + 1/k) for the proportion of digit k. Note that 
ee logy9(1 + 1/k) = 1. Or make a list of all rivers in Russia, or in Canada, and mark 
the first digit of the lengths of these rivers and again the frequency of digit k should get 
near log;9(1+1/k), no matter whether length is measured in miles or kilometers. We refer 
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spect to logarithmic density; one says that prime numbers obey the weak 
Benford law.) 


There is an ordering on densities. We say that d, < dy if and only if, for 
all subsets S of P, the existence of d2(S) implies the existence of d;(5') and 
d,(S') = d2(S). For example, from what we said above, 6 < d. Two densities 
d, and dz are equivalent if and only if d, X dz and dz ~ d,. This equivalence 
relation is denoted by ~. Thus, 6 ~ da, if da is the analytic density defined 
in (9.2). If dy < dg and d, & dz, then we write d, ~ dy. Murata [49, 50] 
showed that the densities dg := dy,,, where the weight functions fy, 
a> 0, 6 > 0 are defined by 


fa,a(@) = 2° "(log x)”, 


give rise to exactly three distinct equivalence classes of densities. The natural 
density d = d1,9 is equivalent to dy,g for all a > 0 and 8 > 0. The logarithmic 
density 6 = doo stands alone in its equivalence class. The third class contains 
the density do; and comprises all remaining densities dy g with a = 0 and 
B > 0. Murata [50] showed that the ordering 


6X do,1 ~< dio, 


holds. 


9.2 The Classic Density Theorems 


There are at least four main density theorems of historic importance: the 
Dirichlet, the Kronecker-Frobenius, the Frobenius , and the Cebotarev den- 
sity theorems. The Cebotarev theorem actually subsumes the other theorems. 
Kronecker and Frobenius discovered their theorems in the early 1880s. We 
make mention of the Dirichlet theorem first. For the sake of brevity, we will 
not state the Frobenius theorem. The Cebotarev theorem, proved in 1922 and 
published in 1925 [73], had been an early conjecture of Frobenius. In fact, 
the Frobenius theorem, although weaker than Cebotarev’s, often suffices in 
many applications. The statements of the two theorems are close relatives; 
one obtains Cebotarev’s from the statement of the Frobenius theorem on 
replacing division classes by conjugacy classes, where division classes come 
from a slightly weaker relation than conjugacy for group elements. Cebotarev 
[68] claimed to have found a proof of his theorem while carrying water uphill 
and buckets of cabbage downtown, not in the cozyness of an office. 


the reader interested in Benford’s law to the book [19], or the survey paper [18] by the same 
two researchers, and to [36, Chap. 1] for an excellent intuitive general-public presentation 
of the Benford phenomenon. 
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If a, b are coprime positive integers, Dirichlet proved in 1837 that the set 
of primes of the form ax + b, where = is an integer, is infinite. In fact, it has 
Dirichlet density 6 = 1/y(a). With the analytic methods that accompanied 
the proofs of the PNT in 1896, it was possible to show this set has natural 
density d (= 1/y(a)).'The only arithmetic progressions az +b, with common 
difference a, which contain infinitely many primes require gcd(a, b) = 1. Thus, 
we see that primes are equidistributed among the y(a) distinct progressions 
{ax + b}, 1 < b <a, gced(a,b) = 1, at least in density terms. See [29] for a 
more precise discussion in particular when a = 4 and 0 is either 1 or 3. Thus, 
the statement of the Dirichlet theorem for primes in arithmetic progressions 
often includes the density result, i.e., if gced(a, b) = 1, then 


d(ta,b) = 5(ta,b) = 1/p(a), (9.4) 
where 7,» is the set of primes of the form az + b. 


Remark 41 The density theorems of Dirichlet, Kronecker, Frobenius, and Ce- 
botarev were first established in relation to analytic density. It was later 
shown any set of primes which possesses a density from one of these the- 
orems must have a natural density as well. See the remarks of Serre [61, 
Chap. VI], and for proofs [55, Chap. 5] and [51, Theorem 7.10*], for instance. 


We note in passing that the set of primes with first decimal digit 1 cannot 
be described using the Cebotarev theorem. As mentioned in Sect. 9.1, this 
set has logarithmic (hence, analytic density), but no natural density. 


We state a few basic facts about number fields which are part of introduc- 
tory books on algebraic number theory; see, e.g., {1, 40, 53]. 


A number field is a subfield of the field C of complex numbers which, as 
a vector space over the field of rational numbers Q, is of finite dimension. 
Therefore, all numbers in a number field are algebraic: they are the zeros of 
some polynomial over Z. Those that are zeros of monic polynomials over Z 
are called algebraic integers. Algebraic integers in a number field K form a 
ring Ox. Thus, Og = Z. Also, for example, Q(V2), the field extension of Q 
generated by V2, is {r + sV2; r,s € Q}. That is, {1, V2} is a finite basis of 
Q(V2) over Q. Its ring of integers is {r + s/2; r,s € Z}. However, the ring 
of integers of Q(/5) is {r + sa; r,s € Z}, where a is (1 + V5)/2. The ring 
of algebraic integers Ox of a number field K has the property that every 
prime ideal is a maximal ideal. Moreover, unique factorization of rational 
integers into their prime factors has a generalization in a ring Ox: every 
ideal factorizes uniquely into prime ideals. Rings of integers, unlike Z, are 


+ The Dirichlet density theorem stated in the form 


1 * dt 
p<; p=b (mod a)}|= =f HO. (ae~eV BB), 
p(a) Jo logt 


for some positive c independent of a, is often called the PNT for arithmetic progressions. 
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not in general principal ideal domains. But any ideal J in Ox may always be 
generated by two elements, i.e., there exist a and bin Ox such that I = (a,b). 
A prime ideal P in Ox contains a (unique) rational prime p. The finite field 
Ox/P is a field extension of Z, := Z/(p). Its degree f = f(P|p) is called the 
inertial degree of P over p. Thus, (Ox /P)/Z, is isomorphic to the finite Galois 
field GF(p/). The corresponding Galois group of the finite field extension is 
cyclic of order f generated by the Frobenius map op: x ++ «”. Suppose L/K is 
a number field extension, i.e., L is a number field containing kK. Then, there 
exists an algebraic number a, called a primitive element of L/K, such that 
L = K(a), where K(a) is the smallest number field containing K and a. In 
fact, a can be chosen to be an algebraic integer. The ideal (p) generated by a 
rational prime in the ring Ox of a number field K need not remain a prime 
ideal. Say 

(p) = Pf PS «Pe, (9.5) 


is the prime ideal factorization of (p) in Ox, i-e., the P;’s are prime ideals in 
Ox and the e;’s are positive integers. Then, we say that p splits into r primes 
in K and e; is the ramification index of P; over p. A prime p is said to be 
ramified in K if at least one of the e;’s is greater than 1. Since only finitely 
many primes are ramified in a given number field K, we state our theorems 
in the unramified case for, as, was remarked in Sect. 9.1, removing finitely 
many primes does not alter densities. Suppose the degree [K : Ql, i-e., the 
dimension of K as a vector space over Q, is n. If f; is the inertial degree of 
P, over p, i =1,...,7r, in (9.5), then we must have: 


r 


n= Seif, (9.6) 


i=l 


or, in the unramified case, }*;_, f; =n. We say the prime p splits completely 
in K whenever all inertial degrees f; are 1, i.e., when r = n. The prime p is 
inert if the ideal (p) remains prime in Ox, i.e., ifr = 1 and f = f((p)|p) =n. 


Suppose L/K is a Galois number field extension, i.e., all field embeddings 
of L in C which fix kK pointwise are automorphisms of L. Put G = Gal(L/K). 
Suppose P is a prime ideal in Ox, unramified in L, and Q is a prime ideal in 
Oy, above P, i.e., such that Q@ contains P, or equivalently, such that Q is a 
prime ideal factor of PO;,. Note that the extension (O;/Q)/(Ox/P) may be 
seen as a finite field extension.’ The group G acts transitively on the prime 
ideal factors of the ideal PO,. The decomposition group D = D(Q|P) is the 
stabilizer of @ in this action, i.e., for 0 € G, we have o € D if and only 
if o(Q) = Q. If o € D, then o induces an automorphism of the finite field 
extension (O;/Q)/(Ox/P). In fact, D is isomorphic to the cyclic Galois 
group D of this finite field extension. (The group D has for generator the 
(generalized) Frobenius map op: «+> a!”!, where |P| is pf(?l), p being the 


+ Since P = QN Og, P is the kernel of the map: Ox% ++ O,/Q which sends x to « + Q; 
thus, Ox /P is injected into O,/Q which contains an isomorphic image of Ox /P. 
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rational prime under P.) The corresponding generator = ©(Q|P) of the 
group D is called the Frobenius automorphism relative to P and Q. It has 
the defining property that for all z € Or, 


&(x) = 2'P! (mod Q), (9.7) 


i.e., ® is the only element of G with property (9.7). We readily see that if 
o € G, then &(0(Q)|P) = oGo0~'. Thus, in a normal extension, the prime 
ideals above P all have the same inertial degree f, and the degree of the 
Galois extension is rf, if P splits into r primes. Note that as we shift from 
one prime ideal above P to another, the Frobenius automorphism visits a 
whole conjugacy class, Cp = Cp(L/K), of G. This conjugacy class is usually 
denoted by the Artin symbol 
Fr] 


P 


We are ready to state the Cebotarev theorem. We do so restrictively by as- 
suming the bottom field is Q since we are interested in the natural density 
of rational primes rather than the Dirichlet density of prime ideals in a gen- 
eral number ring Ox. The theorem asserts that, as we go through primes, 
the frequency with which the Frobenius automorphism & belongs to a given 
conjugacy class C' of the Galois group G is proportional to the size of C, i.e., 
is equal to the ratio |C|/|G]. 


Theorem 127 (Cebotarev’s density theorem) Let K be a Galois number 
field extension of the rationals with Galois group G. Let C be a conjugacy class 
of G. Then, the set of unramified primes p in K for which C,(K/Q) is C 
has a Dirichlet and a natural density equal to |C|/|G|. 


We state the Kronecker-Frobenius density theorem, an immediate and 
useful corollary to the Cebotarev theorem. 


Theorem 128 (The Kronecker-Frobenius density theorem) Let K be 
a Galois number field extension of the rationals with Galois group G. Then, 
the set of rational primes which split completely in K has a natural density 
equal to 1/|G|. 


Proof If a rational prime p splits completely in Ox, then the inertial degree 
of a prime P in K above p is 1. Thus, the Frobenius automorphism &(P|p) 
has order 1 in G. The identity of the Galois group forms a conjugacy class 
all by itself. The result follows from the Cebotarev theorem. 


We state another corollary for cyclotomic extensions. It shows how the 
Dirichlet density theorem follows from the Cebotarev density theorem. 


Corollary 129 Suppose K is the cyclotomic Galois extension Q(¢), where 
¢ =e?™'/" n> 1 an integer. Then, primes of the form k+ nz, (x > 0), where 
1<k <n, ged(k,n) = 1, have Frobenius automorphism defined by ¢ + ¢*. 
Their density is 1/y(n). 


9.2 The Classic Density Theorems 227 


Proof The Galois group G of Q(¢) is the set of automorphisms of K defined 
by Te: C+ C*, for 1 < k < nand gcd(k,n) = 1. Thus, G is isomorphic to the 
multiplicative group of integer modulo n, and |G| = y(n). The automorphism 
that sends ¢ to ¢?, i.e., Tp, satisfies the property (9.7). Therefore, 7, is the 
Frobenius automorphism ®(P |p), (P in Ox above p). Since ¢* = CP if and 
only if p = k (mod n), we see that ®(P|p) = 7, if and only if p has the 
shape k + nx. Since G is abelian, {7} is a conjugacy class of G. Thus, by 
Cebotarev theorem, primes of the form k + nx have density 1/y(n). 


Remark 42 Note that the order f of &(P|p), P a prime ideal in Ox, is equal 
to the multiplicative order of p modulo n. Hence, primes in the progression 
k+nz split into r= y(n)/f prime ideals in Ox. In particular, primes of the 
form 1+ nz split completely in K, while those of the form —1+ nz, (n > 2), 
split into y(n)/2 prime ideals in Ox. 


Although not a density theorem, the Kummer-Dedekind theorem is often 
useful in our context. It points to a complete analogy between two factor- 
izations: the factorization of an irreducible polynomial modulo a prime ideal 
and the factorization of this prime ideal in an extension containing a zero of 
the polynomial. See, e.g., [40, Theorem 27] for a proof. 


Theorem 130 (The Kummer-Dedekind theorem) Suppose L/K is an 
extension of number fields and a is an algebraic integer such that L = K(a). 
Let m= m,(«) denote the minimal polynomial of a over Ox. Then, for all 
but finitely many prime ideals P in Ox, the factorization of m (mod P) has 
the shape: 


m=M1M2°++Mp (mod P) — > POr = Q1Q2::-Q,, 
where Q; is the prime ideal PO, + (m;(a@)) in Ox lying above P. 


Thus, if K = Q and L = Q(i), (i = V—1), then odd rational primes 
p which split into two ideals in Oy are the primes 1 (mod 4). Indeed, the 
minimal polynomial of i, m;(x) = x? +1, factorizes modulo p into two linear 
factors if and only if —1 is a square modulo 9, i.e., if p= 1 (mod 4). Primes 
of the form 3 + 4k are inert primes. Say p = 5. Then, 2? +1 = 2? —4 = 
(a — 2)(a + 2) (mod p). By Theorem 130, the two prime ideals above 5 in 
Ox are P, = (i— 2,5) and Py = (4+ 2,5). Since 5 = (2 —1)(2 +7), 5 belongs 
to the ideals (2 — 7) and (2+ 7). Hence, P; = (2—%) and P2 = (2 +7). More 
generally, for all primes p= 1 (mod 4), say p = u? + v”, the two prime ideals 
above p are the principal ideals (wu — iv) and (u+iv). Here, Oy is the ring of 
Gaussian integers Z|7], which is well known to be a principal ideal domain. 


228 9 Density of Companion Sequences 


9.3 Polynomial Modulo p and Cebotarev 


The Galois group of a polynomial f € Z[z] is the Galois group of L/Q, where 
L is the splitting field of f. That is, Z is the number field extension of Q 
generated by the zeros of f. If f is irreducible of degree n > 1, then G may 
be viewed as a transitive subgroup of the symmetric group S,,. Indeed, G acts 
transitively on the zeros a1, Q2,...,Qp of f. In fact, in some sense, for almost 
all degree-n irreducible polynomials over Z, G is isomorphic to $,, [28]. 


Let us assume, for the sake of simplicity, that a is an algebraic integer of 
degree n, i.e., its monic minimal polynomial m = m,(x) over Z has degree 
n. Denote Q(a) by K and the splitting field of ma(x) by L. Let G be the 
Galois group of the normal extension L/Q and H the subgroup of G of 
automorphisms that fix K pointwise, or equivalently that fix a. Thus, H is 
the Galois group of L/K, and the index of H in G, [G : H], is equal to [Q(a) : 
Q| = n by Galois theory. The Kummer-Dedekind theorem (Theorem 130) 
establishes a one-to-one correspondence between the factorization of m into 
r irreducible factors modulo p and the splitting type of the ideal (p) in Ox 
into r prime ideals, where degrees of the factors of m match the inertial 
degrees of corresponding prime ideals. This same common factorization can 
be expressed in yet another way: the action of the Frobenius automorphism 
& = &(Q|p), where Q is a prime ideal above p in Oz, it does not matter 
which such Q, on the complex zeros of ma(x), yields a permutation in S,, 
whose cycle-type decomposition matches the factorization of m modulo p. 
More precisely, we have, using the above notation, the following theorem. 


Theorem 131 For all but finitely many primes p, except primes ramified in 
L or divisors of the discriminant of ma(x), Ma(x) splits into r irreducible 
factors of degrees f; < fo <---< f, in F,[a] if and only if the decomposition 
of ®(Q |p) into disjoint cycles as a permutation of the complex zeros of Mq(x) 
contains r cycles of respective lengths fi, fo,...,fr, where Q is any prime 
ideal of Oz above p. 


Proof The action of & = &(Q|p) on the left cosets oH of H in G by left 
multiplication produces r orbits whose lengths match the inertial degrees of 
the prime ideals P above p in Ox [40, Thm. 33]. The action is given by 
®-oH = OcH. The orbits cH, cH, €?cH,...,6°~1oH are in one-to-one 
correspondence with the prime ideals P = o(Q)™ Ox which come into the 
factorization of pOx. Moreover, ¢ = f(P |p). To each such orbit corresponds 
a cycle in the action of ® on the complex zeros of mg(x), namely, the cycle 
(a(a), &(a(a)), B2(a(a)),..., 1 (a(a))) with &*(a(a)) = o(a). 


The Cebotarev density theorem and Theorem 131 tell us how often a 
polynomial factorizes in a prescribed manner modulo primes. We indicate 
this in the form of an immediate corollary. 
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Corollary 132 Let g(x) be a monic irreducible polynomial over Z with Ga- 
lois group G. Suppose 7 in G decomposes, as a permutation of the complex 
zeros of g(x), into r cycles of lengths fi, fo,..., f-. Then, the set of rational 
primes such that g(a) factorizes into r irreducible factors in F,[z] of degrees 
fi, fo,..-, fr has a natural density equal to |C|/|G|, where C is the conjugacy 
class of 7 in G. 


Thus, the factorization of a polynomial modulo primes can be used to 
yield information on the Galois group of the polynomial, and determine its 
structure; see [75, Sect. 8.10]. 


Before examining a few density-related consequences on the factorization 
of polynomials modulo primes, we provide a couple of numerical tables which 
illustrate Corollary 132. 

The polynomial f(x) = 24 — 4a? — x + 1 in Z[z] has a Galois group G 
isomorphic to $4. Therefore, all permutations of its four zeros occur. The first 
line of Table 9.1 indicates the five cycle-type decompositions of permutations 
in S4. The second line gives the density of rational primes p for which the 
Frobenius map &(P |p) lands in the various conjugacy classes of G. Here, a 
conjugacy class is characterized by a cycle-type decomposition, and P is a 
prime above p in O;, L being the splitting field of f(x). The next 3 lines 
indicate, for 3 lists of 24 consecutive primes p, how many achieve a given 
factorization type for f(x) (mod p). Even on such a small sample of primes, 
we see a fair adequation of the number of primes such that f(a) factorizes 
into a given type, say, for example, two linear factors and a quadratic, and the 
conjugacy class size of transpositions in S4 which has type [1, 1, 2]. Line 5 has 
the number of expected primes if the density proportion were to be respected 
for each of lines 2, 3, and 4. The discriminant of f(a) is 19-103. Thus, f has 
multiple zeros modulo the primes 19 and 103. More precisely, the factorization 
of f is of type [17,2] in both cases. They were assimilated as belonging to 
the type [1, 1, 2]. (Note that, among the first 72 primes, f(x) factorizes into 4 
linear factors only for pg9 = 347, namely, (a —117)(a—140)(a+106)(#+151) 
(mod 347).) 


Table 9.1 Factorization of «* — 4%? — x +1 (mod p) 


Cycle types [1,1,1, 1 [1, 1, 2] [2, 2] [1, 3] [4] 
Densities 1/24 1/4 1/8 1/3 1/4 
Pl,+++;P24 0 5 1 11 7 
P25,-++,P48 0 6 3 10 5 
PA9,-++,P72 1 5 4 8 6 
‘Expected numbers’ 1 6 3 8 6 
Pl,--+,P72 1 16 8 29 18 
‘Expected numbers’ 3 18 9 24 18 
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Let us consider the polynomial g(x) = «+ — 2. The decomposition field 
of g is Q(W2,%), where i = /—1. Put a = ¥2. The four zeros of g are a, 
ai, —a, and —ai. Let us number them as 1, 2, 3, and 4, respectively. The 
Galois group is of order 8, and because a member of G has to send 7 to +7 
and a to ta or tai, one easily verifies that G is isomorphic to the dihedral 
group generated by complex conjugation 7 = (24) and o = (1234). Thus, G 
contains the identity, two transpositions (13) and 7, the three permutations 
of type [2,2], and two 4-cycles o and o~'. Since the discriminant of g is a 
power of 2, we tested the factorization of g modulo all 24 odd primes less 
than 100 and recorded the data in the table below. 


Table 9.2 Factorization of a* — 2 (mod p) 


Cycle types [1,1,1, 1] [1, 1, 2] [2, 2] [4] 

Densities 1/8 1/4 3/8 1/4 
P2,+++,P25 2 6 10 6 
“Expected numbers” 3 6 9 6 


We turn our attention to the following question. How many zeros should 
we expect an irreducible polynomial over the integers to possess in Z,[z], 
where p is a prime? Say f(x) is an irreducible polynomial of positive degree 
n over Z. If 0 <i <n, then the set of primes p such that f(a) has exactly 
i zeros modulo p has a density, say d;, as we will see. We have the obvious 
relationship pean d; = 1. Furthermore, the average number of zeros of f 
modulo p as p varies through the primes is 1. 


More formally, we have the following. 


Theorem 133 Let f(x) be an irreducible polynomial over Z of degree n > 1. 
Let d;, 0 < i < n, denote the density of the set of primes p for which f 
modulo p has exactly i zeros. Then, 


n 


yo = 1. 


i=1 


Theorem 133 was first stated by Kronecker [34], but a complete proof that 
included the existence of the densities dj; came with Frobenius. 

For instance, if f(7) = 2? +1, then d, = 0 since f cannot have a zero 
without having two. Thus, 2d; = 1. Therefore, d) = dp = 1/2. That is, 
primes congruent to 1 (mod 4) have the same density as those —1 (mod 4), 
something we already knew from the Dirichlet theorem (9.4). Say f is a 
cubic polynomial with alternating Galois group Ag. Only the identity of A3 
has some fixed point. It fixes the three zeros. By the Cebotarev theorem, 
the density of primes for which the Frobenius automorphism is the identity 
is 1/3. Hence, we see that dj = 1/3, dj = d2 = 0 and do = 2/3. Thus, we 
have checked that Theorem 133 holds in such a case. If the Galois group is 
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S3, then by the Cebotarev density theorem d3 = 1/6, d, = 1/2, since S3 
contains three transpositions, and dy = 1/3 (there are two three-cycles, or 
dp + d; + d3 = 1). We have again verified that 3d3 + d, = 1. 

Theorem 133 can easily be extended further. Suppose f is a nonzero dis- 
criminant polynomial over Z. Then, f has N irreducible factors in Q[z] if and 


only if 
re 
i=1 


Remark 48 Because of the rapidity with which counts of primes experimen- 
tally settle down toward the actual densities, this property could possibly be 
used to determine N. 


We establish a couple of lemmas before proving Theorem 133. The first 
lemma was proved by Frobenius in 1887, but its genesis originated in a work 
of Cauchy in 1845 according to Rotman [58, Thm. 3.26]. 


Lemma 34 Suppose G is a finite group acting on a finite set X. Then, 


1 


gEG 


where N is the number of orbits in the action and F'(g) is the number of 
elements in X that are fixed by g. 


Proof If x and y are in the same orbit O, their respective stabilizers S, and 
Sy in G have the same cardinality, say So. Indeed, if o(x) = y for some 
o € G, then S, = o~'S,o. The cosets of S; are naturally in one-to-one 
correspondence with the elements of O. Thus, |G| = |Sz|-[G: Sz] = So-|O|. 
Therefore, 


Pos >> yo leeeeel= 5) lee =e] 


gEG geEG rex xEX gEG 
=DiiSel= D7) dNSel= D7 [Ol So 
cEX orbits O EO orbits O 
= D0 IGl=N-14L, 
orbits O 


where we have used the Iverson symbol [—]. 


Lemma 35 Let f be an irreducible polynomial over Z with n complex zeros, 
(n > 1), and Galois group G. Let Z;, 1 =0,...,n, be the set of elements of 
G which fix exactly i zeros of f. Then, the primes p for which f has i zeros 
in its factorization modulo p have a natural density d; = |Z;|/|G]. 
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Proof Viewing G as a transitive subgroup of the symmetric group S,,, we 
see that Z; is a union of conjugacy classes, since two elements in the same 
conjugacy class must have the same cycle-type decomposition and, in par- 
ticular, the same number of fixed points. Hence, by Corollary 132, the set of 
primes p for which the factorization of f modulo p has 7 linear factors has 
density |Z;|/|G]. 


Proof of Theorem 133. By Lemma 34, we find that 


IG|= 0 FO) =d5 YO FG) 


gEG i=0 geZ; 
n n 

= y y i=) 4Z,| 
i=0 geZ; i=0 


Hence, by Lemma 35, 


Theorem 134 Suppose f(x) is an irreducible monic polynomial over Z. Let 
a be a zero of f(x), K = Q(a) and L be the splitting field of f(x) in C. Let 
G be the Galois group of L/Q and H be the subgroup of the automorphisms 
of G which fix K pointwise. Then, the density d of primes p for which f(x) 
has at least one linear factor modulo p is given by 


d= | UseG o'Ho| 
IG| 


Proof If ¢ € G, then cHo~! is the subgroup of G which fixes o(a). Since G 
permutes transitively the zeros of f(z), the set U,<goHo™! consists of all the 
elements of G which have at least one fixed point. Noting that U,cegoHo + 
is a union of conjugacy classes, namely, of the conjugacy classes containing an 
element of H, we see, by Corollary 132, that |Usego !Ho|/|G| is the density 
of primes for which f(a) has at least one linear factor in its factorisation 
modulo p. 


It is well known that the proportion of permutations with no fixed points 
in S,, approaches e~! as n tends to infinity. This gives the following corollary. 


Corollary 135 Suppose f(x) is an irreducible monic polynomial over Z of 
degree n with Galois group isomorphic to S,,. Then, the set of primes p such 
that p| f(x), for some integer k, possesses the prime density: 


which tends to 1 — 4 ~ 0.6321 as n increases. 
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Proof If G is isomorphic to S,,, then U;ego!Ho = U1 S;,(i), where S;,(i) 
are the permutations of {1,2,...,} which fix 7, (1 < i <n). By the principle 
of inclusion-exclusion, we get: 


So ISn@l-—  S2 [Sn Sn(Z)| +> 


| Ui Sn(%)| 


1l<i<n 1<i<j<n 

ee nm = nm i. _4)n-2 n 

= (7)isnal-($)is-al +--+ yr2(,", isi 
+(-11 


Hence, dividing by n!, we obtain: 


ta (=r 
ae ne rl 
= 1.1, 2% @ (-1}" 
~ oO! ot! = 2! 3) aa n! 
7 n (-1)* 
ee kl’ 
k=0 


from which the result follows. 


Examples. Suppose f(x) is the polynomial «+ — 4x? — x + 1 with Galois 
group S4 of Table 9.1. By Corollary 135, 


1 1 1 5 
dy =1 (1 1+35 at ay) = p= 0.0. 

The density dy is also the sum of the densities associated with each of the 
decomposition types [1, 1,1, 1], [1, 1,2], and [1,3], ie., with those types which 
have at least one fixed point. By Table 9.1, this is the sum 4 + $+ S. 
If g(x) is the polynomial «* — 2 of Table 9.2 with dihedral Galois group G, 
then H = (r), where, recycling the notation adopted for the zeros of g(x) 
above Table 9.2, 7 = (24), i.e., 7 is complex conjugation. The only conjugate 
of T in G is (13). Thus, U,ego+Ho is made up of the identity and the two 
transpositions (24) and (13) giving a density d, = 3/8. 


Corollary 136 Suppose f(x) is an irreducible monic polynomial over Z of 
degree n. Then, the set of primes p for which p{ f(t) for all integers t has a 


density do which satisfies 
n—-1 
do = 


n! 


Proof The condition p { f(t) for all integers ¢ is equivalent to f (mod p) 
having no zeros. Thus, this set has a density, denoted by do in Theorem 133, 
by Lemma 35. Moreover, using Theorem 133 and its notation 
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i=l i=0 
do ees 1)d; > (n—1)dy > — me 


since, by Theorem 128, dn = 1/|G| > 1/n!, where G is the Galois group of f. 


As a consequence if f is of degree n > 2, primes which do not divide any 
f(t), (¢ an integer), have a positive density. Thus, if f has a zero modulo all 
but finitely many primes, f must be a linear polynomial. We point out that 
no elementary proof is known of the fact that infinitely many primes do not 
divide f(t), for any t, when f is irreducible of degree > 2 [47]. A contrario, 
there are several elementary proofs of the existence of infinitely many primes 
dividing a nonconstant polynomial in Z[z] (see, e.g., [40, Ex. 30, p. 91], [47]); 
several proofs are based on the rate of growth of polynomial sequences and 
the fact they take the same integer value at most d times, where d is the 
degree. 


9.4 Initial Inquiries of Sierpinski, Brauer, and Ward 


Given a sequence of rational integers X = (Xp)n>o0 and a prime number p, we 
say that p divides X, or p is a divisor of X, and write p | X, to say that pis a 
factor of at least one term of X. There are relatively few sequences for which 
we know how to prove unconditionally, i.e., without additional conjectures 
such as Riemann hypotheses, the existence of a natural density for the set 
P(X) of prime divisors of X, and simultaneously compute its value. That is, 
if we discard trivial instances where that density is 0 or 1, e.g., a nonzero 
geometric sequence has a 0 density, while a sequence, periodic modulo primes, 
with a zero term, such as the Fibonacci sequence, has a 1 density. However, 
as seen in Sect. 9.3, and in Theorem 134, if X is a polynomial sequence, e.g., 
Xn =n? +1, n4 — 2n? —n+1 or n4 — 2, then we can find the exact prime 
density of X, i.e., of the set P(X), provided we are able to find the associated 
Galois group. 

Even if early inquiries did not seek answers in density terms, estimating 
the size of the set of prime divisors of a companion Lucas sequence began with 
two independant sources, one in Europe with a paper of Sierpinski of 1958 
followed by a 1960 paper of Brauer having to do with the sequence (2” + 1) 
and sequences (a +1), a a positive integer; the other source originated from 
California with a paper of Ward of 1961 dealing with prime divisors of the 
sequence of Lucas numbers L = (L,). 
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Sierpiiski [65] considered the partition of the set of odd primes into two 
classes, the class A of primes that divide a number of the form 2" +1 and the 
class B of primes dividing a number of the form 2?”*!—1. He observed that all 
primes +3 (mod 8) belong to A. Indeed, 2 is a quadratic nonresidue modulo 


such a prime p. Hence, by Euler’s criterion, p divides 2° +1. On the other 
hand, primes p = 7 (mod 8) all belong to B. The integer 2 is then a quadratic 
residue modulo p. Hence, p | 2° — 1. But (p — 1)/2 is odd. Thus, p € B. In 
the remaining class, i.e., primes 1 (mod 8), it was noted that infinitely many 
were in A and that some belonged to B. All Fermat numbers 2?” +1, m > 2, 
are pairwise coprime. Thus, if p | 2?” +1, then 2”+*! must be the order of 2 
modulo p. Since 2"! | p—1 for m > 2, we see that p = 1 (mod 8). Hence, 
as all Fermat numbers are coprime to one another, there are infinitely many 
primes dividing the sequence of Fermat numbers (2?” +1)m>2, and they must 
all lie in A. Brauer [22] noted that for a > 2 an integer, separating primes 
according to whether they divide a number a” + 1, or a number a?”+! — 1, 
comes down to partitioning according to the parity of the order of a modulo 
p. Thus, for a = 2, A is the set of primes with respect to which the order 
of 2 is even, B for which it is odd. He also pointed out that for primes 1 
(mod 8), if 2°||p—1, then p € B if and only if 2 is a 2°-th power residue of p. 
Thus, the idea of splitting primes according to the exact power of 2 in p—1, 
which is essential to Hasse’s method, began to emerge. Brauer also showed 
that infinitely primes 1 (mod 8) are in B. Indeed, for all primes 9 (mod 16) 
representable by the quadratic form 65a? + 256xry + 256y?, 2 is an octavic 
residue. He used the knowledge that if one prime of a given linear form is 
representable by a positive properly primitive quadratic form, then infinitely 
many primes of that linear form are representable by the quadratic form. 
Here, 73 = 9 (mod 16) and, for x = 3, y = —1, 73 = 65a? + 256ry + 256y?. 
In terms of prime density, these findings only gave that the lower density of 
A is at least 1/2, and its upper density at most 3/4. Possibly, the two sets 
A and B could still each be of density 1/2. Hasse’s method [30], presented 
in the next section, will show A to have a natural density of 17/24, closer to 
3/4 than to 1/2. 


Given a second-order integral linear recurring sequence and an integer m, 
Ward [79] posed the important arithmetical question of deciding whether m 
is a divisor of the sequence. He argued that the problem can be reduced to 
finding out the prime divisors of the sequence. Thus, he dedicated his paper 
to a detailed investigation of the prime divisors of a particular sequence, the 
sequence L = (L,,) of Lucas numbers — he calls them Fibonacci numbers of 
the second kind which explains the title of the paper.’ By (2.8), we have: 


LD? —5F? = 4(-1)". (9.8) 


+ See our footnote on the appearance of the terminology “Lucas numbers” for the numbers 
of the sequence L, Chap. 1, p. 2. 
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Ward draws some interesting observations from (9.8). If a prime p = 1 
(mod 4) divides LZ, then 5 must be a quadratic residue of p. By quadratic 
reciprocity, p = +1 (mod 5). By contraposition, primes 1 (mod 4) and +2 
(mod 5) do not divide L. Now suppose p = —1 (mod 4). Put n = p, the rank 
of p in the Fibonacci sequence, in (9.8). Then L? = 4(—1)? (mod p). Since 
—1 is a quadratic nonresidue of p, p must be even. By Theorem 19, p is a 
prime divisor of (L,,). Alternatively, this result can be seen using Lemma 16, 
since (—1|p) = —-1 => p| Lg@-ny2- 


This information is summarized in a visual manner in Table 9.3. 


Table 9.3 Elementary consequences of Equation (9.8) 
p =1 (mod 4) p=-—1 (mod 4) 


p=x1 (mod 5) ? p|L 


p=+2 (mod 5) pt{L p|L 


Consequently, the lower and upper asymptotic densities of the set P(L) 
satisfy 


1 = 3 
5 <d(L) <d(L)< E 


It remained to study primes of the forms 20n +1 and 20n +9, for which both 
—1 and 5 are quadratic residues. For these primes, we have k := v2(p—1) > 2. 
Setting g,(z) =x?" ' — 2° * — 1, Ward proves [79, Thm. 3.2] that p+ L if 
and only if g,(x) splits completely into linear factors modulo p and gx(x?) 
has no linear factors. This criterion is equivalent to how the Hasse-Lagarias 
method proceeds in order to find out the density of non-divisors of ZL. Had 
Ward been inclined to think in terms of density, he surely could have done 
so. Instead he found out criteria to determine whether primes of the forms 
20n + 1 or 20n + 9 divide L. Such primes have a unique representation by 
the quadratic form u? + 4v?, and, as easily checked, 5 must divide uv. Thus, 
Ward proves, for instance, that 


p|L = worv=+l (mod 5). 


We point out the existence of an efficient algorithm of Hermite, as improved 
by Brillhart [23], to find u and v. By Ward’s criterion, we readily see that 
101 = 17+ 4-5? divides L, whereas 109 = 3? + 4-5? does not. Similarly, 
401 | L, but 409 + L. More generally, all primes of the forms 1 + (5x)? and 
4+ (5a)? divide L, while all primes of form 9 + (5x)? do not. 


The antecedents of the Hasse-Lagarias method are detectable both in 
Brauer’s and Ward’s papers. 
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9.5 The Hasse-Lagarias Method 


Companion Lucas sequences V = V(P,Q) are among the rare arithmetic 
sequences which, in the nondegenerate case, have been proven to possess a 
nontrivial natural density of prime divisors, i.e., a density d = d(V) belonging 
to the open interval (0, 1). Moreover, this density is a rational number, and, in 
principle, is computable. For instance, d(V(4,3)) = 2/3, d(V(8,2)) = 17/24 
and d(L) = 2/3. The first two sequences are (3” + 1) and (2” 4 1). In 1966, 
Hasse [30] developed a finely crafted method to evaluate the prime density of 
sequences (a” + 1), for a not + a square. In particular, it fully answered the 
Sierpinski question. With the Hasse method, it is relatively easy to classify 
all densities of companion sequences (a" +b”), when a and 0 are integers; see 
[80], [5, Thm. 3.1.3]. In 1985, Lagarias [35] showed the Hasse method could 
be extended to computing the prime density of certain second-order linear 
recurrences with irrational roots that include companion Lucas sequences.' 
In particular, he illustrated the method with the calculation of the prime 
density of the sequence L of Lucas numbers, thereby completing, at least in 
some sense, the work of Ward [79]. This explains the title of this section: The 
Hasse-Lagarias Method. 


There is something nearly magical in associating with an arithmetic se- 
quence, as simple as a second-order integral recurrence, a number in (0, 1) 
which measures the size of its set of prime factors. People, mathematically 
versed or not, have often reacted with some curiosity at being told that on 
average, 17 out of 24 primes divide a number of the form 2” +1. We have al- 
ready seen that although defined as additive objects, Lucas sequences possess 
remarkable multiplicative features. 


Even though density results are asymptotic by nature, experiments with 
prime divisors of linear recurrent sequences frequently show the proportion of 
divisors from a small collection of primes lining up closely with the asymptotic 
density value. This can provide a way of checking the calculated density as 
well as sometimes a chance at guessing its value, if the density is a simple 
rational number. 


We provide a couple of tables, Tables 9.4 and 9.5, which illustrate this 
phenomenon. The first row gives the number n of primes tested for division. 
(We always tested the least primes p1,...,Dn.) The corresponding count of 
primes dividing V is indicated as cy. The last row gives the number of primes 
expected to divide if the asymptotic proportion were to hold for the finite set 
{Biysesaba fe 


+ Lagarias rediscovered the method as he was initially unaware of the work of Hasse until 
Jean-Pierre Serre informed him of it (private communication). 
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Table 9.4 Prime divisors of (3% + 1) and of the sequence (Lz) of the Lucas numbers 


n 30 | 60 | 90 | 120 | 150 | 180 | 210 | 240 | 270 | 300 |... | 900 
c(3z41) | 20] 39] 59] 78 | 98 | 121 | 139 | 157 | 176 | 198 |... | 601 
cL 19 | 39] 58) 82 | 99 | 121 | 141 | 163 | 185 | 206 |... | 599 
‘Expected’| 20 | 40 | 60 | 80 | 100 | 120 | 140 | 160 | 180 | 200 |}... | 600 


Table 9.5 Prime divisors of (2% + 1) 


n 24 | 48 | 72 | 96 | 120 | 144 | 168 | 192 | 216 | 240 |... | 960 
Car41 15 | 32 | 49 | 65 | 84 | 103 | 117 | 135 | 153 | 167 |... | 682 
‘Expected’| 17 | 34 | 51] 68 | 85 | 102 | 119 | 136 | 153 | 170 |... ] 680 


We now prove the Hasse density result for d(V) when V,, = a"+1 and a is 
a square-free integer such that |a| > 1. The set of odd primes is the disjoint 
union of all PJ, 7 > 1, where P) = {p: p= 1+ 2% (mod 2/*")}. By the 
Dirichlet density theorem (9.4), d(P?) = 1/y(2/*1) = 2-4. Observe that the 
densities d(P’) sum up to 1. The Hasse method evaluates within each P’ the 
proportion of primes that divide no term of V. Put N; = {p€ P;: pt V} 
and D; = {p€ P;: p| V}. Suppose p{ a and p € P!. Let p be the order of a 
(mod p). The number p is also the rank of appearance of p in U(a+1,a). As 
seen in Theorem 19, p | V if and only if p is odd. Thus, p+ V iff a®—))/?’ = 1 
(mod p), since 2) is the full power of 2 in p— 1. Therefore, because Fy, is 
cyclic, a is a 2/-th power in Z/pZ. Hence, the polynomial x?’ — a has a root 
in Z/pZ. In fact, this polynomial factorizes completely modulo p since Z/pZ 
contains 2/-th roots of unity. By Remark 42, p = 1 (mod 2/) implies that p 
splits completely in Q(¢2;). By the Kummer-Dedekind theorem, any prime 
ideal above p in Q(¢,;) must split completely from Q(¢;) to Q(@;, */a). 
Therefore, p splits completely in the field F; := Q(¢2;, %/a). It does not split 
completely in K; := Fj(¢gi+1) because p # 1 (mod 2/+1), By the Kronecker- 
Frobenius density theorem, N; has density: 


1 1 
[Fi:Q [kj:Q 


Suppose a is not +2. Then, \/a is not in Q(¢p;). (The only quadratic number 
fields lying in Q(¢g) are Q(i), Q(V2), and Q(./—2).) Therefore, the degree of 
F; over Q is 


d(Nj) = (9.9) 


[Q(¢as) : Q] - [Fj : Q(Coi)] = 27-7 - 2? = 47/2. (9.10) 


Moreover, [K; : Fj] = 2. Hence, d(N;) = 2-4-3 —4-9 = 474. The density of 
primes not dividing V is 
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Thus, the complementary set, the set of prime divisors of V, has density 2/3. 
To be rigorous, we should only have concluded that the lower asymptotic 
density of non-divisors, d(N), is at least lim, ee d(N;) = 1/3 since N 
contains Ul_,N; for all n > 1. But each Dj, being the complementary set 
of N; in P;, has density 2~/ — 4~7. Since D contains U?_, D; for all n > 1, 
d(D) > lim, ae d(D,;) = 2/3. Therefore, N, the complementary set of D in 
P, has upper asymptotic density d(N) bounded above by 1/3. The conclusion 
follows. 

Suppose a = +2. Then, ¢g = (\/a + i\/a)/2. Therefore, Fy = Q(i, Wa) 
contains ¢g. As a result, Fy = Ky and, by (9.9), d(N2) = 0. Also, the degrees 
of F; and K,, for j > 3, have been halved since Q(¢;) now contains /a. 
Therefore, 


1 1 1 1 1 1 t 
day) 7 gt 0+ Daly) a a 5 Gl —47 34 — 2a? 
j21 j23 j23 
and the density of prime divisors of (a” + 1) is 
2 _ 7 17 
d(2” +1) = d((—2)" +1) =1- 54 OA! (9.11) 


(1/24)-th more than 2/3. 


When the zeros a and 8 of x? — Px + Q are quadratic irrationals, the 
same method applies, but the density computation is done in two separate 
steps. On the one hand, one computes the density of primes that split in 
Q(a) and do not divide V; on the other, the density of primes inert in Q(a) 
not dividing V. For the latter subset of primes, we fix j > 1 such 2/ ||p+1 
(rather than p — 1; for inert primes the rank of p divides p+ 1) and evaluate 
for each such 7 the density of primes not dividing V much in the way done 
for p — 1. See [35] and [13, Chap. 6]. 

Lagarias enunciated sufficient conditions for an integral irreducible second- 
order linear recurring sequence X = (aa™ + Ga"), to fall under the Hasse 
method: if there exists y € Q(a) such that 


cy, (9.12) 


Qi1 8 
II 
Lt 
TT 
a) 
© 
3 
a 
Qala 
II 


where ¢ is a root of unity in the field Q(a), k = 1 or 2, and j is an integer, 
then one should be able to calculate unconditionally the prime density of 
X = (aa” +4"), by the Hasse-Lagarias method. (Here, @ and @ denote the 
algebraic conjugates of, respectively, a and a within Q(a).) 


Companion Lucas sequences satisfy the Lagarias conditions (9.12); take 
7 = a/a = a?/Q,k =1, 7 =0 and ¢ = 1. But there are integral second-order 
recurrences which are not companion Lucas sequences and satisfy (9.12). In 
fact, Lagarias [35] worked out the density for such a sequence. However, we 
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will observe in Sect. 9.7 that any recurrence satisfying (9.12) must be a torsion 
sequence in the Laxton group associated with the characteristic polynomial 
of the recurrence. 


As mentioned earlier, the densities of companion Lucas sequences were 
fully described in [5, Thm. 3.1.3] in the case of reducible characteristic poly- 
nomials. No such description exists for all cases of irrational roots. How- 
ever, Moree [42] gave the prime density for all companion Lucas sequences 
V(P, —1). In these cases, V, = €” +€", where € = (P+ VP? + 4)/2 is the fun- 
damental unit in Q(/P? + 4). But Moree and Stevenhagen [44] went further 
and worked out the densities of companion Lucas sequences (e" + é”), for all 
fundamental units € of real quadratic fields Q(Vd), (d > 1). If the norm of 
e is —1 and d > 2, then the density is 2/3. If the norm of € is 1, the density 
is 1/3, unless (€ + €)/2 +1 is a rational square. In 2007, Ljujié [39] wrote a 
Master’s thesis under Stevenhagen’s direction, where, in particular, d(V) is 
shown to always be a rational number in (0,1), and the values of d(V) are 
given under various conditions on the irrational root a. Ballot [12] gave a 
proof that for almost all pairs of parameters (P,Q), d(V(P,Q)) is equal to 
2/3 in the sense that the set of exceptional pairs (P,Q), |P| < x and |Q| < z, 
has cardinality o(x?) as 2 —> +00. 


When the zeros a and £ of x? — Px + Q are rational integers, finding out 
the density of prime divisors of companion Lucas sequences boils down to 
discovering the density of primes p for which the order of a fixed rational 
number r modulo p is even. Considerable progress [52, Theorem 1.3], [43] has 
been made in the larger question of finding out uniform asymptotic estimates 
in m and x, where m is a positive integer, for the proportion of primes p < x 
for which the order of r modulo p is divisible by m. We will refer to this type 
of question as the order divisibility problem, or simply as the order problem. 
Hasse [31] had already considered the case when r is a rational integer and m 
is a prime number. Wiertelak [80], in the late 1970s, found some asymptotic 
estimates for the order problem when m is an exact prime power. To give an 
idea of the type of more advanced knowledge we have, here is a proposition 
[15, Lemma 2] based on results in Pappalardi’s 2003 paper [52]. 


Proposition 187Suppose a > 1 is an integer, but not an integer’s k-th 
power for any k > 2, and m is an odd positive integer. Let €,(p) be the order 
of a modulo p, where p { a is a prime number. Define A = {p ; m | £,(p)}. 
Then, for all € > 0, the estimate 


#4(2)= (am+0( A) Ya, 


holds uniformly in m and «x with 


1 i 
dm = as II C= (q prime). 


q|m 


9.5 The Hasse-Lagarias Method 241 


As the concept of the rank of a prime in a fundamental Lucas sequence 
U generalizes the order of a rational number, finding out the proportion of 
primes for which m | py(p), m a positive integer, is a natural generalization 
of the order problem. The successful study of the case when x? — Px + Q has 
irrational zeros came years later. Several 1998 conjectures in the Fibonacci 
case U = F,, made on experimental grounds by Bruckman and Anderson [24], 
concerning the frequency with which pp(p) is divisible by a given positive 
integer m, or is divisible by an exact power of a prime, were proved in 2014 
by Cubre and Rouse [25]. The first conjecture stipulated that the set of primes 
p for which m | pr(p) has the density: 


2 
Cm qd F 
dy, = eae | | Pot’ (q prime), (9.13) 


q|m 


where c,, is a correction factor equal to 5/4, if m = 10 (mod 20), 1/2, if 
20 | m, and 1, otherwise. Thus, for instance, primes with a Fibonacci rank 
divisible by 30 have density: 


5 1 25 


dso 24° 16 24° 


Jb hee HS 5 SO 
4 30 3 ° 8 84° 43°38" 
An important contribution to the nondegenerate irreducible case was made 
by Sanna [59] in 2020; he obtained an asymptotic formula for #A(x) in the 
form d,,7(a), plus an error term which depends on U and m. Here, m is an 
odd positive integer (not divisible by 3 in case Q(a) = Q(/—3)) and A is the 
set of primes p for which m | pyu(p). Then, 


2 
Cm q . 
dy, = ne | asa (q prime), (9.14) 
q\m 


where 
Cm = ged(m®, h)~! + nu(m), 


h is the largest exponent such that a/6 = y", for some y € Q(a), and ny (m) 
is 0, most of the time, i-e., if D > 0, or Do #1 (mod 4), or Do { m™, but 
depends explicitly on gcd(m®, h) and the square-free part Dp of D = P?—4Q, 
otherwise. Here, m°° is the formal product hele: p~. Thus, if x is an integer, 
then gcd(a,m°) is the product of all prime powers p”, where p | gcd(x,m) 
and p” || x. 

The particular cases when Q(a) = Q(./—3) and m = 3 or 6 were dealt with 
in another context [13, Theorem 6.9]. The “usual” densities in these instances 
are d3 = 3/4 and dg = 1/2. The densities dz and dy when Q(a) = Q(V—1), 
Q not the square of an integer, nor twice such a square, are d2 = 5/6 and 
d4 = 2/3 [13, Theorem 6.2]. 


242 9 Density of Companion Sequences 


Remark 44 Before closing this section, we point out that much work has been 
done on prime divisors of the general integral second-order linear recurrence. 
In 1921, Pélya showed [54] the linear recurrence (c,a” +c2b"),», has an infinite 
number of prime divisors, if a, b, ci, cz are nonzero integers, and rfr3 # 1 
for all x and y in Z, unless « = y = 0, where r) = a/b and rg = —c2/c1. 
In 1954, Ward [77] showed any integral second-order nondegenerate linear 
recurrence has infinitely many prime divisors. Some of these sequences are so- 
called “torsion” sequences — see Sect. 9.7.1, and companion Lucas sequences 
are among them. But for the general non-torsion second-order recurrence, 
following Hooley’s idea on Artin’s conjecture, density results, conditional on 
the truth of generalized Riemann hypotheses, were obtained by Stephens [67] 
in 1976, and, in 2000, were refined and corrected by Moree and Stevenhagen 
[45]. For the general non-torsion second-order linear recurrence X = (cja” + 
cob”), a, b integers, the prime density of X, is expressed in the form 


dO) = Oe: I (1- = :) meres (9.15) 
p prime 
where the above product S, over all primes p, was introduced by Stephens and 
C'x is a positive corrective constant that depends on X, which often lies in a 
small interval around 1 (see [45, p. 303]). The value of the Stephens constant S 
is approximately 0.57596. Moree and Stevenhagen [46] challenged themselves, 
with success, to find, under Riemann hypotheses, the prime density of the 
second-order linear recurrence Y, mentioned by Lagarias [35, p. 451], with 
Yo = 3 and Y; = 1, which satisfies the same recursion as the sequence LD of 
Lucas numbers, i.e., Yn42 = Yn41 + Yn, (n > 0). Note that Y has irrational 
roots, but their result is again expressed in the form d(Y) = Cy - S, where 


Cy = 2B ~ 1.0026, and S is the Stephens constant. 


9.6 A Heuristic View 


Why is it that primes that divide even-indexed terms V2, are so often twice as 
many as primes dividing odd-indexed terms V2,41? Even though we have the 
Hasse-Lagarias method and a precise argument to say that for most choices of 
pairs (P,Q) this phenomenon holds, one cannot but feel that simple intuition 
is eluding us on why it should be true. Shouldn’t they split equally between 
the two categories? We recall John Selfridge at the WCNTC,' probably in 
the late 1990s, with his distinctive red sweater, saying during one of his 
interventions, with a grin on his face, that he had an idea why this held. 
Quizzing him later on his remark didn’t bring an “ah, hah” moment. The 


+ The West Coast Number Theory Conference; it was held at Asilomar, a conference center 
near Monterey, California, as we recall in that instance. 
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question lingered and we felt this was a later heuristic view, somewhat in the 
way Daniel Shanks (see [62, 63] and [64, pp. 80-83]) used to proceed, which 
brought about a satisfactory intuitive answer. 


A first simple version of these heuristics was presented in [9, Sect. 5]. If a 
is a positive integer and d, is the density of primes p for which the order of 
a (mod p) is odd, then, using, say Theorem 3.1.3 in [5], it is a simple matter 
to prove that limy N~' 37, <,<y da = 1/3. Perhaps as a nod to Wang-Wei’s 
quotation that started the chapter, we may reverse the point of view. Fixing a 
prime p, the heuristic argument evaluates the proportion of positive integers 
a whose order modulo p is odd, to find again an expected mean of 1/3. These 
heuristics were pursued and used in Sects.2 and 3 of [12] for general and 
specific sets of primes p defined by 2 + py(p). In particular, the case U = F 
was treated. They were later applied [13, Sect. 6.3] to the cases m + py(p) 
when 1. D = —E?, m = 2 and 4, and 2. D = —3F?, m = 2, 3 and 6, where 
F and F are integers. 


Let us look at the simplest case: prime divisors of (a” + 1), where a € 
Z\ Z?,a#—1,a# +2. We may ignore the finitely many primes that divide 
2a. Hence, we assume a € Z}. The heuristic follows the Hasse method. We 
evaluate the density of primes p for which the order h, of a (mod p) is odd. 
Partition the primes into the sets P? = {p: p= 1+ 2) (mod 2/*1)}, for 
j => 1. The heuristic part of the argument consists of assigning a probability 
P; for a random prime p in P? not to divide (a” + 1). If p € P’, then the 
number of 2/-th power residues in ZF is (p—1)/2/, i.e., the order of the group 


(g?’), generated by g?’, where g is a primitive root modulo p. Assuming a 
(mod p) to be a random element in Z>, as p varies in P!, its chances of hitting 
a 2/-th power residue are 2~J(p — 1)/(p — 1), that is, 2-7. Note that h, is 
odd if and only if a (mod p) is a 2/-th power residue modulo p. Hence, we 
put P; = 274. Since the weight of P! in the set of all primes is given by its 
density, which, by the Dirichlet theorem, is 2~4, the heuristic predicts that 


d(o? +1) =1- Slaps). Py =1- oad a1 5 = 4, 


j21 j21 


which is the density proved by Hasse. The heuristics [12, p. 135] for a = +2 
have to be modified because for 7 = 2, p= 5 (mod 8) and a is necessarily a 
quadratic nonresidue of p, whereas if j > 3, then p = 1 (mod 8), and a, on 
the contrary, is necessarily a quadratic residue of p. 


Let us try to work out these heuristics for the set of primes p whose 
Fibonacci rank pr(p) is a multiple of 10. By (9.13), these primes have density 
4° 33. Here, F = U(1,—1), a and § are the zeros of x? 2-1, D=5 
and V = L, the sequence of Lucas numbers. Suppose p = +2 (mod 5). By 
Theorem 9, p | p+ 1. But 5 { p+ 1. Hence, 5 { p. Similarly, if p = —1 
(mod 5), then 10 { p(p). Indeed, p | p—1, but 5 { p—1. It remains to examine 


primes 1 (mod 5). Suppose p = 3 (mod 4). By Lemma 15, p | Lip—1);2- 
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Thus, by Theorem 19, 2 | p. Therefore, 10 | p if and only if 5 | p. Let 
Pi = {p; 5’ ||p—1}, j > 1. By the Dirichlet density theorem, P3 has density 
4/p(57+) = 5-9. Assume further that p € P2. Then, 5+ p iff (a/B)®-Y/® = 
1 (mod p), which occurs iff a/f is a 5’-th power modulo p. The proportion of 
5/-th power residues modulo p in Z* is 57. Assuming —a* = a/f (mod p) 
is as likely as any other member of Z>, to hit a 5/-th power residue as p varies 
in Pi , we expect a density of 


dl 4a(PZN{p; p=3 (mod 4)}) 5-4 = 58 x 46) 


j21 j21 


of primes p = 11 (mod 20) such that 5 { p(p). It follows that primes 11 
(mod 20) with 5 | p have density: 


a 1 5 

8 BB (9.17) 
Consider now primes 1 (mod 4), that is, primes 1 (mod 20). Thus, we are 
dealing with a set of primes of density 1/8. Suppose 2? || p—1 for some j > 2. 
Then, p(p) is odd iff (—a?)®-)/?’ = 1 (mod p). This occurs iff —a? is a 
2/-th power residue modulo p. The number of 2/-th power residues in Z? 
is (p — 1)/2?. However, since —1 is a square modulo p, —a? is a quadratic 
residue modulo p. The odds of —a? hitting a 2/-th power residue are then 
2-I(p — 1)/2-1(p — 1) = 2!-9. Primes p = 1 (mod 5) with 2/||p — 1 have 
density 4~!- 2-4. Therefore, we expect primes 1 (mod 20) with odd rank to 


have density: 
. 4d 1 
g-@4+5) . 91-5 = A Ae 
x 52 


j22 j22 


This is saying that the probability of primes 1 (mod 20) to have odd rank 
is 1/3. Thus, their probability of having even rank is 2/3. Assuming the 
independence of the two events “p has even rank” and “p has rank divisible 
by 5,” we find that 10 | p with probability 

255 


= - prob(5 |p, ifp=1 (mod 20)) = ao o 
The 5/6-th probability above comes from carrying out the very same calcu- 
lations that led to the densities in (9.16) and (9.17), only replacing primes 
3 (mod 4) in (9.16) by primes 1 (mod 4). Thus, the density of primes with 
rank divisible by 5 among primes 1 (mod 20) is 5/48. This yields the claimed 
probability of (5/48)/(1/8) = 5/6. Since primes 1 (mod 20) have density 1/8, 
our heuristics give the density: 
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for primes 1 (mod 20) and rank divisible by 10. Summing 5/72 and 5/48, we 


get a density of 
5 (1 1\ 25 
24\5 ° 2) 65247 


the correct density! Some readers will surely enjoy finding similar heuristics 
for m = 20. 


As seen in the above, these heuristics only use the Dirichlet density theo- 
rem, the cyclic structure of Z>, and some simple equidistribution assumptions. 
In Sect. 9.8, we will see that comparable heuristics apply to finding the den- 
sity of sets of primes that divide higher-order linear recurrences, but where 
division is defined in ways other than the usual one. 


Remark 45 If U(P, Q) is a regular Lucas sequence, then, extending the result 
of Matiyasevich and Guy [41] for the Fibonacci sequence, Kiss and Matyas 
[33] proved that 

log |[U1, U2,..., Un]| 6 


lim A = >, 9.18 
peeeo log | []j,=1 Us nm ( 
where [x, *,...,*] stands for the least common multiple. We may view (9.18) 


as another manifestation of the strong divisibility property of a regular 
U(P,Q). From [20], the above ratios, with the terms U;(P,Q) replaced by 
V;i.(P, Q), have a limit equal to +. Perhaps, this result is to be put in relation 
with the common prime density d(V) = 2/3. Note that, if instead of terms of 
Lucas sequences U or V, one puts [17] sequences of type a” — ¢,8", where 
Gm is a primitive m-th root of unity, with m = 3 or 6 if Q(a) = Q(V—3), 
and m = 4, if Q(a) = Q(./—1), one also obtains limits for the corresponding 
ratios, namely, 135/(1677), 234/(2577), and 80/(977), when m is 3, 6, and 


4, respectively. 


9.7 The Laxton Group and Its Generalizations 


9.7.1 The Laxton Group 


In 1969 and 1970, Robert Laxton published two rich papers [37, 38] that 
dealt with integral second-order linear recurrences and their prime divisors. 
He constructed a semigroup structure F'(f) on the set of linear recurrences 
that share a common characteristic polynomial f of degree two. The product 
in this semigroup has the property that it preserves division by primes. That 
is, if p is a prime, p| X and p| Y, then p| X * Y, where X * Y represents 
the semigroup product of the two sequences X and Y. By “identifying” two 
sequences if they differ merely by an index shift, or if they are rational scalar 
multiples of one another, then one gets a set of equivalence classes, G(f), 
on which the semigroup operation in Ff) is well defined and makes G(f) a 
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group. Moreover, division by primes is still preserved by the group operation. 
A prime p divides a class in G(f) if there is a sequence X in that class 
such that p | X,, but p { Xn41, for some index n. This division of X is 
called proper division. Suppose X € F(f). If p| X, then either p is a proper 
divisor of X or it is a null divisor, i.e., p divides all terms from some point 
onward. The distinction between the usual division and proper division is 
minimal as Ward [76, 77] showed an X in F'(f) possesses at most a finite 
number of null prime divisors. The fundamental Lucas sequence U(P,Q) 
and its companion V(P, Q) have a central place in the group G(f). Indeed, 
U(P, Q) is the identity element of F(f) and the class of V(P,Q) has order 2 
in G(f), and this class is, for most f(x), the only nontrivial torsion element in 
G(f). Many subgroups and quotient subgroups were studied and identified in 
[37]. The last pages of [37] and the second paper [38] are devoted to properties 
of torsion elements, and a complete description of T(f), the torsion subgroup 
of G(f). 


Besides Laxton’s papers, partial descriptions of the Laxton group or refer- 
ences to its structure have appeared in various places in the literature [5, pp. 
15-18], [26, Sect. 1.1.7]. More recently, several mathematicians from Japan 
have worked on finding group theoretical descriptions for the structure of the 
Laxton group [2, 3, 69-71]. Nevertheless, we briefly present, in the Laxton 
manner, this group here again. 

Given f(x) = 2? — Pr+Q €Z[z], PQ 40, P? 4 4Q, Laxton defined the 
set F'(f) of nongeometric linear recurring sequences (X,,), which have integer 
terms from some point onward and satisfy: 


X42 — PXp41 _ QXy. 


If a and # are the zeros of f(x), then there are nonzero algebraic integers A 
and B such that Aa” — BB? 
an — 
Xx, = —————, 9.19 
4 (9.19) 
for all integers n. We find that A = 2, — 496, B = x; — roa and write the 
sequence X as (A, B). The semigroup operation * in F'(f) is defined by 


X « X' = (AA’, BB’), (9.20) 


if X = (A, B) and X' = (A’, B’). As easily verified, the initial values yo and 
yi of the product sequence Y := X « X’ satisfy 


Yo = Lox, + xx — Pxox5, (9.21) 
Y= 124 _ Qxoxg, 

where 2, #1; and 2, x are the respective initial terms of X and X’. Thus, 

AA’ = yi — yo8 and BB’ = y; — you. One readily sees from (9.21) that if 

p | x and p | x, then p | yo. (In fact, if in addition p { x and p{ x}, 
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then p{ y,. Thus, proper division is preserved in F'(f), and, thus, division of 
classes in G(f) is also preserved.) 


Two sequences X and Y in F'(f) are defined to be equivalent, and we write 
X ~Y, if and only if there is an integer s and a rational number r such that 


In =TYnt+s; 


for all indices n. The semigroup operation * is well defined on equivalence 
classes of sequences. Moreover, G(f), the set of classes, together with the 
operation * derived from *, forms an abelian group. The inverse of the class 
of (A, B) is the class of (B, A). One may observe that (B, A) belongs to F(f) 
since B is 


© — Zoe = £1 — Zo(P — B) = (x1 — Pap) —(—x9)8, (or (—Qaz_1) — (—29)8). 


By (9.19), we have U = (1,1), and, by (9.20), the class of the fundamental 
Lucas sequence U is seen to play the role of the identity element of the group 
G(f). Also, V = (P — 28, P — 2a) = (a — 8,8 — a). Hence, 


V«V =(D,D) =D(1,1)=D-U, 


and we see that the class of V is a torsion element of order 2 in G(f). 

In [35], Lagarias computed the prime density of three second-order linear 
recurrences: (2” + 1), (Z,) and a certain sequence (W,,). Their respective 
prime densities were 17/24 (as first shown by Hasse), 2/3 for the Lucas num- 
bers as mentioned in Sect. 9.5 and 3/4 for (W,,). The first two are companion 
Lucas sequences; the third one is not. However, (W,,) has order three in the 
Laxton group associated with f(a) = x? — 5a + 7. This is readily checked: 
Wp = 1 and W, = 2; the zeros of f(x) are (5+/—3)/2. Thus, if W = (A, B), 
then A = 2— 6, where 6 = (5 — /—3)/2. Therefore, 

5— /-3 1+ 7-3 
2 


= ¢3, 


the third root of unity e?'"/?. Hence, 
W*xW*W =(AS, B3) = ee (G@)*) = (1, 1) =U (6, 7). 


In fact, to date, all second-order linear recurrences for which the prime 
density is unconditionally known all belong to the torsion subgroup T(f) of 
some G(f).' The group G(f) is infinite, while T(f) is finite. In fact, T(f) 
often consists solely of the classes of the fundamental and the companion 
Lucas sequences U(P,Q) and V(P,Q). 


+ However, with the use of generalized Riemann hypotheses, one can find the prime density 
of other second-order recurrences — see [45, 46], or Remark 44 of this book. 
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We saw earlier that Lagarias gave sufficient conditions (9.12) for a second- 
order linear recurrence to fall under the Hasse-Lagarias method. It was shown 
[5, pp. 20-22] that the class of a sequence obeying the Lagarias conditions 
must be of finite order in G(f); in fact, its order must divide 12. Given 
f = (a —a)(a — @), the set L(f) of classes of recurrences which satisfy 
the Lagarias conditions (9.12) forms a subgroup of the torsion group T(/f). 
(There are polynomials f(x) for which L(f) is a proper subgroup of T(f); 
see [5, Prop. 2.4.5].) 

When the root field Q(a) contains cube roots of unity, and hence sixth 
roots of unity and D = (a — a)? € —3Z?, T(f) contains at least a cyclic 
subgroup of order six. Similarly, if Q(@) contains fourth roots of unity and 
(a—a@)? € —Z?, then T(f) contains a cyclic subgroup of order four. Williams’ 
paper [82] — unrelated to Laxton’s point of view — develops properties of the 
sequences of order three or six when Q(qa) contains e?'*/%, in particular, 
properties leading to primality tests 4 la Lucas for numbers of certain types. 
Ballot’s long paper [13] studies the six “torsion” sequences when D = —3F?, 
and the four ‘torsion’ sequences when D = —E?, F and E integers, from 
many points of view. The purpose was to bring convincing evidence that these 
second-order integral linear recurrences ought to be viewed as purebred Lucas 
sequences (i.e., authentic companion Lucas sequences only of orders 3, 6, and 
4 instead of 2 for (V,)). That is, for discriminants of the form D = —E?, 
there are four Lucasian sequences G, H, V, and U, where Gp = Ho = 1, 
G, = (P+ E)/2, H, = (P — E)/2; for D of the shape —3F”, there are six: 
Y, Z, S,T, V, and U. The (W,,)-sequence, of which Lagarias computed the 
3/4-th prime density, is an S, or a T, sequence (depending on the sign given 
to F’, since So = Tp = 1 and Sj and T} are (P+ F’)/2). We have Yo = Zp = 1 
and Yi, Z, are (P + 3F)/2. For instance, most of the important formulas 
given in Chap. 2 involving U and V have analogues involving G and H, or 
S,7T,Y,and Z. Many arithmetic properties have their counterparts too. To 
give only a few examples, the double-angle formula (2.23), i.e., Van = UnVn, 
has the counterparts [13, Eqs. (2.28), (2.61)]: 


Usn = 2UnVnGnHn if D = —B. 


td 


Usn = 3UnVnSnTaYnZn, if D=—3F?. 


The Euler criterion for Lucas sequences (Theorem 36) has analogues that 
decide when a prime p divides X(p_¢,)/4, for X = U,V,G or H, if D = —E”, 
or when p | X(p—¢,)/3, X = U,S or T, if D = —3F?. We saw in Theorem 19 
that prime divisors of the V sequence are those with an even rank py. When 
D = —E”’, the G and the H sequences share the same prime divisors by 
Laxton’s semigroup law: the class of one is the cube of the class of the other, 
and the group law preserves prime division. Except for primes dividing 2Q, 
prime divisors of G, or H, are the primes with a rank py divisible by 4. 
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Likewise, when D = —3F?, primes with a rank py divisible by 3 are the 
divisors of the S and the T’ sequences; those with a rank divisible by 6 are 
the divisors of the Y and the Z sequences, which are of order 6 in the Laxton 
group. As mentioned in Sect.5.8, the fact that the sum eer it is zero 
modulo p? when p is maximal has analogues involving the pairs (G, H), (S, Z), 
and (T,Y), all of which differ by a factor of V with respect to the Laxton 
group law. For instance, say f has discriminant D = —E?. Then, we have 
V«G = E-H ~ H, where « is the semigroup operation in F(f). Ifp{QF isa 
prime of maximal rank p = py = p—€p, then Speed 7, 18 zero modulo 
p*, where py is the least positive h such that p | Hp. Since ¢, = (—E? |p), 
the rank py exists because 4 | p—€,. We saw in Sect. 9.5 that in some precise 
sense, for almost all choices of parameters (P, Q), the prime density of V(P, Q) 
is 2/3. Results in the same vein have been shown for the V and G sequences 
which, for almost all discriminants of the form —E?, have respective prime 
densities 5/6 and 2/3. Similarly, the V, S, and Z sequences have respective 
prime densities 2/3, 3/4, and 1/2, for almost all discriminants of the from 
—3F?; see [13, Chap. 6]. 

Given f(r) = 2? — Px+Q in Z[z], Laxton [37, Thm. 4.4] proves that T(f) 
contains three elements of order two if and only if Q = R?, where R € Z. 
The two other elements besides the class of V(P, Q) come from the sequences 
A = (An)n>0 and B = (B,)n>o0 with respective initial values 1 and P + R. 
The four classes of U, V, A, and B thus form a copy of the Klein group. 
Laxton observes [37, Thm. 4.6] that 


Uon+1 = An : Bn, 


and that the sets of prime divisors of A and B are disjoint. It follows that the 
sets of prime divisors of the three order-two sequences A, B, and V form a 
three-block partition of the set of all primes,’ the primes of odd rank dividing 
either A or B, those of even rank dividing V. 

There are cases when each of the three blocks of the partition have equal 
prime density, i.e., a density of 1/3. For instance, if ¢ = a + bVd is a fun- 
damental unit of norm —1 in the ring of integers of a real quadratic field 
Q(Vd), (d > 3 squarefree), then the A, B and V torsion sequences associated 
with a? — (4a? + 2)x +1 = (x — €?)(x — @) has each prime density 1/3; see 
[14]. Up to a constant, we find that Ay, = Van4i(P,—-1), Bn = Uon4i(P, -1) 
and V,,(P? + 2,1) = Van(P,—1), where P = 2a. Let p be the rank of primes 
with respect to U(P,—1). Then, primes dividing B have odd rank p, primes 
dividing A satisfy 2 ||, and primes dividing V(P? + 2,1) have a rank p with 
2-adic valuation greater than or equal to 2. For € = (1 + V5)/2, Vinson [74] 


+ but possibly the prime factors of Q 
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observed that if Ty is the period of the Fibonacci sequence modulo a prime 
p, and pr(p) its rank in the Fibonacci sequence, then Tr = mr(p)- pr, where 
the multiplier m(p) may only take the values 1, 2, or 4. Prime divisors of 
the three order-two torsion sequences A, B, and V with respect to x? —32+1 
have the alternate respective characterizations mp(p) = 1, mp(p) = 4 and 
mp(p) = 2, each set having density 1/3 [14]. In particular, the prime density 
of the Lucas numbers (L,,), which Lagarias [35] proved to be 2/3, is the set 
of primes satisfying mp(p) = 1 or 2. 

If Q is a square and D = —3F7?, then the group generated by the classes of 
A and one of the order-six sequences, say Y, is a subgroup of T'(f) isomorphic 
to Zz x Ze. If instead D = —E?, then T(f) contains a subgroup isomorphic 
to Zo x Za. 

Let us examine an example in more detail. For f(x) = 2? —2x+49, we find 
that D = —3-8?. Thus, F = 8 and the group (A, Y) is isomorphic to Zz x Ze. 
In Table 9.6, for each of the 12 sequences X, we counted the number of 
primes p, when p varies from p; = 2 to poo9 = 1223, which divide at least one 
term among X2, X3,...,Xp41. Below, we indicate our (educated) guess for 
their prime density (something that can be checked using the Hasse-Lagarias 
method and the Laxton group properties to alleviate some of the work). Since 
Y and Z have respective initial values 1 and (P43F)/2, we computed the two 
initial values, using equations (9.21), for each sequence. We got U = [0,1], 
¥ = [Ls 8 = [L5) @ YY?) V = 22) ey, FS [1a]  Y*), 
Z = [1,—-11] (~ Y®), A= [1,-5], Ax Y = [6,-114] =6-[1,-19], Ax 9 = 
[—2,-—74] = —2- [1,37], Ax V ~ B = [1,9] (we used B instead of Ax V), 
AxT = [-10, —34] = —2- [5,17] and Ax Z = [-18,6] =6- [-3, 1]. 


Table 9.6 Twelve torsion sequences and their prime divisors 


Sequence U ¥ S Vv T Z 

# Prime divisors 199 49 149 67 149 49 

Conjectured prime density] 1 1/4 3/4 1/3 3/4 1/4 
Sequence A AxY|AxS|AxV|AxT|]AxZ 

# Prime divisors 69 50 52 63 52 50 

Conjectured prime density] 1/3 1/4 1/4 1/3 1/4 1/4 


Using a probabilistic terminology, we may guess from the data that the two 
events “2 | p” and “3 | p” are independent. Indeed, one-third of the primes 
divide V. Three-fourths of the primes divide S, i.e., have a rank divisible by 
3. Now 2| p and 3| p <=> 6] p, and we note that the product 1/3 x 3/4 
yields the “probability” 1/4, which is the prime density of Y, i.e., of primes 
whose rank is divisible by 6. 
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9.7.2 Generalizations of the Laxton Group 


There is a generalization of the Laxton group, discovered in [5], which is 
well-suited to prime maximal division.' 

Suppose X = (X;,)n>o is an integral linear recurrence with monic charac- 
teristic polynomial f(x) in Z[a] of degree m > 2. Let p be a prime number. 
Then, p is a (prime) mazimal divisor of X if and only if there are exactly 
m — 1 consecutive terms of X divisible by p, i.e., if and only if 


p| Xi, t=n, nt+l1,...,n+m—2, but pt Xn+m-1, (9.22) 


for some index n. 

The concept was introduced by Ward [78] in 1954 and generalizes the 
notion of proper divisor for m = 2 mentioned in the previous subsection. 

The fundamental recurrence U with Up = Uy =... = Um_2 = O and 
Um_—1 = 1, which is an extension of the fundamental Lucas sequence to all 
m > 2, obviously has every prime as a maximal divisor; in fact, it is known 
(60, 66] that all recurrences in F'(f) with a density of prime maximal divisors 
equal to 1 are in the class of U. The generalized Laxton group for maximal 
division will have the class of U for its identity element. Classes are defined as 
previously, i.e., two sequences in Ff) belong to the same class if some index 
shift and multiplication by a rational number applied to one sequence yields 
the second sequence. As noted by Ward [78, Eq. (2.4)], all integral linear 
recurrences X with characteristic polynomial f, assuming f has m distinct 
ZEYOS Qi, Q2,---,;Q@m, have the form: 


m Qn 
i= a, 2 
are (9.23) 


where f’ is the derivative of f and the A;’s are algebraic integer constants in 
the root field of f. The set F'(f) is limited to the integral linear recurrences 
of exact order m, ie., satisfying []j”, Ai 4 0. 

Representing the sequence X in (9.23) as (Ai, Ag,..., Am) and assuming 
Y = (Bi, Bo,..., Bm), we define the operation * on F'(f) by 


X*xYor= (A, By, Ag Bo,...,AmBm). 


The product sequence X * Y is an integral linear recurrence, and (F(f), *) 
is an abelian semigroup with identity. The identity is the fundamental recur- 
rence U which turns out to be (1,1,...,1) [5, pp. 76-77]. The sequence 


¥ In 1994, one of the authors, C. Ballot, was invited to Cambridge, UK, by R. Pinch to give 
a couple of lectures on the material developed in [5]. The stay was memorable in several 
ways, including the experience of some of the traditions unique to the place. R. Laxton 
had taken the train from Nottingham to attend the first lecture. He was overjoyed that 
some then-young mathematician had at last done something with his early discoveries. 
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where Bj = []iciem;i¢j Ai, (1 < 7 < m), belongs to F(f), and its class is 
the inverse of the class of X = (Aj,...,Aj;,...,Am) in (G(f), *), where * is 
the well-defined operation: Class of X * Class of Y = Class of (X *Y). 

It was shown in [5, §5 Chap. 5] that when f(x) has m nonzero rational 
roots, then G(f) contains a subgroup M(f) isomorphic to Z3’~'. Hence, 
there are 2™~1 — 1 distinct class recurrences which are torsion of order two. 
(If m = 2, then M(f) is simply the group with two elements generated by 
the class of the companion Lucas sequence.) 

A notion of rank, p, OY Pmax, Of maximal division for a prime p is now 
introduced. It is, if it exists, the least positive index n such that 


p|U;,i=n, n+1,...,n+m-—2, but pt Unsm—1, (9.24) 


where U is the fundamental recurrence. By definition, maximal division of X 
occurs at n iff 


p | Xn, Pp | Xn+1; +++, D | Xn+m—2) but pt{ Xntm-1- (9.25) 
Maximal division of X = (Aj,...,Am) occurs at n iff 
Aja} = Aga =---= Anan, (mod (p)), (9.26) 


where (p) is the ideal generated by p in the ring of integers of the root field 
of f, and the congruence holds in that ring. 

It is not difficult to see that p |maz U at n if and only if p | n. Thus, given 
an integer s > 2 finding out the prime density, assuming existence, of 


{p; $ | Pmax(p)} (9.27) 


is a reasonable extension of the order problem mentioned in Sect. 9.5. 


Particular attention was given to the cubic case in Chap. 4 of [5]. In that 
case a maximal divisor divides two consecutive terms of the sequence. Given 
an abelian group (G,*) and an element a in G, one can easily construct an 
isomorphic group (G,+) with identity element a. It suffices to define the 
operation + by r+y=a*y*a_!, for all x and y in G, where a! is the 
inverse of a in (G,*). Using this remark, it was shown that we may favor 
other kinds of division than maximal division. For instance, we may favor 
the notion of twin division. Say p is a twin divisor of X if and only if there 
is some n > 1 such that 


Pp | Xn—1, DP | Xn+1, but pt Xn. 


It was proved that the aforementioned isomorphism from the generalized 
Laxton group (G(f), *) to (G(f), +) using a = T, where T is the recurrence 
with cubic characteristic polynomial f(a) and initial values [0, 1,0], has the 
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additional property that it transforms maximal divisors of a given element 
into twin divisors of its image. Thus, in particular, any density computation 
of the set of maximal divisors of a given recurrence (or equivalently of its 
class) corresponds to the density of twin divisors of its image through the 
isomorphism. 


The above generalization of the Laxton group, we recall, was done as- 
suming the m zeros of f(x) to be distinct. Another relevant group [6] was 
discovered in the case f(x) has exactly one double zero. The nth Cullen num- 
ber, C,,, is defined as C,, = n2"+1. Robinson [56] showed all Cullen numbers 
Ch, 2 <n < 1000, were composite, except for Ci4,. Here is, in our context, a 
striking property of the Cullen numbers, not unrelated to their high level of 
compositeness. Suppose p is an odd prime. Then, by Fermat’s theorem, we 
see that 


Cp-2 = (p— 2)aP-2 +4 =-2P 14+1=0 (mod p), 
Cp-1 = (p—1)2?-1+1=-1+1=0 (mod p), (9.28) 
Cp = p2?+1=1 (mod p). 


That is, every prime p > 3 is a maximal divisor of the third-order recurrence 
(Cn)n>o0- Yet, it is not equivalent, in the Laxton sense, to the fundamental 
recurrence U with initial values [0, 0, 1]. Suppose f(x) = (~—a)?(a—B) € Z[z] 
and write f(x) = (a — a)(a — 8) for the squarefree part of f. Then, there is 
a generalized Cullen-number recurrence (C's(7))n>0, namely, 


7 n 28 -—a A B a 
A oe io 


in F(f) which satisfies, for all primes p { a8(a— 8), 


Cy(p— 2) =Ce(p—1)=0 and Cy(p)=1 (mod p). (9.30) 


It turns out that by broadening, or, as one may prefer to say, by weakening 
the notion of equivalence used by Laxton, we obtain a relevant group G(f), 
in which the generalized Cullen-number recurrence and the fundamental re- 
currence belong to the same class. Incidentally, it remains true that only the 
identity class contains recurrences with a density of maximal prime divisors 
equal to one. Moreover, there is an isomorphism from G(f) to the Laxton 
group G(f1) which sends all maximal prime divisors to proper divisors of the 
image. If f(x) = (a —1)(a — 2), then the class of the sequence (n2” — 1)n>0 
is sent onto the class of the companion Lucas sequence (2” + 1)n>0, which 
was seen in Sect.9.5 to possess a 17/24 density of prime divisors. Hence, 17 
out of 24 primes on average divide two consecutive terms of (n2” — 1). We 
do not know whether relevant group structures exist whenever there are two 
double zeros, or one triple zero. 
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Recently Suwa [72] wrote a preprint reformulating in terms of affine group 
schemes the aforementioned isomorphism between G(f) and G(f1). 


9.8 The Density of Maximal Prime Divisors 


It is possible to extend the Hasse-Lagarias method to compute the exact 
prime density of maximal divisors of some torsion recurrences of the gener- 
alized Laxton group G(f), where f € Z[z] is monic of degree m > 3. The 
Hasse-Lagarias method can also solve, in some cases, the generalized order 
problem as stipulated in (9.27). As mentioned in Sect. 9.7, if the roots of f are 
nonzero distinct integers, then the torsion subgroup T(f) of G(f) contains a 
subgroup M(f) isomorphic to Z™~+. The generic maximal prime density of a 
recurrence whose class belongs to an order-two element of M(f) is 2/(2™—1) 
(see [5, Prop. 5.5.5]). Note that for m = 2, we recover the common 2/3-density 
of a companion Lucas sequence. 

If f(x) = (w—a)(a— 8)(a—y) = 23 — Px? + Qu — R, where a, B and 7 are 
three distinct nonzero integers, then the three order-two recurrence classes 
in M(f) have for representants Vi = [2,2a, Pa — By], V2 = [2,28, P68 -— ay], 
and V3 = [2,27, Py — aG]. They can be written respectively as (—A, A, A), 
(B,—B, B), and (C,C,—C), for some nonzero integers A, B, and C. Using 
(9.23), we easily see that p is a maximal divisor of V; iff for some n > 1: 


(3) =(2)= oman 


which can be translated into the equivalent condition 


V2(h1) = V2(h2) > Is 


where h, and hg are the orders of a/ and a/y modulo p. The above con- 
ditions permit one to start a Hasse-Lagarias process. If a; # +r?a,; or 
a; # +2r?a,;, where a;, a; are two of the zeros a, B, or y, and r is a rational 
number, then each V;, 7 = 1,2 or 3, has a density of maximal divisors equal to 
2/7; see [5, Prop. 4.6.4]. The three sets of maximal prime divisors of the V;’s, 
1 <i < 3, are disjoint. Hence, their union has density 6/7 under the above 
conditions on the zeros of f. This union is also {p; 2 | pmax(p)} providing 
an answer to a case of the generalized order problem (9.27). One may refer 
to [5, Thm. 5.6.5] for some answer to the generalized order problem, when 
S=q, a prime, and f has degree m > 2 and m distinct zeros in Z. 


Thus, the density of primes dividing two consecutive terms of (5"+2-3”—1) 
is 2/7 ~ 0.286; it is 65/224 ~ 0.290 for the recurrence (3" — 2"*1 — 1), where 
one zero is twice another. This is to be compared with the fact that the prime 
density of (2” + 1), which is 17/24, is a notch higher than that of (3” + 1), 
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which is 2/3. In fact, 17/24 and 65/224 obey the same formula: a ‘oS 
for m = 2 and 3, respectively. 


These densities lend themselves well to the type of heuristics encountered 
in Sect. 9.6. We illustrate this with two examples. 


Heuristics for Two Cubic Recurrences 


1. Let M, = 5” + 2-3" —1 for n > 0. We present a simple probabilistic 
argument that yields a prime density of 2/7 for the set 


A= {p; pt 30, p| ged(Mn, Mn+1) for some n}. 
Let p be a prime in A. Then, p divides M, and M,,+, if and only if 
P| Mn4i1-—3M, and p| Mn i1—5Mp. 


But Mn41 — 3M, = 2-5" +2 and Mn41—5M, = —4-3" 44. Thus, pe A 
if and only if there is an n € N for which the system (S),) of exponential 
congruences 


(Su) : 2 - E ae (9.31) 


is solvable. Let h, and hg be the respective orders of 5 (mod p) and 3 
(mod p). Then, (9.31) is solvable if and only if hy and hg are both even 
and n is an odd multiple of both h1/2 and h2/2. But these conditions hold 
exactly when we have 


V2 (hy) = V2(h2) = 1 


Thus, A = {p; p{ 30, vo(h1) = ve(hg) > 1}. Let us fix 7 > 1 and evaluate 
the probability P; that a prime p satisfying v2(p — 1) = j be in A. For any 
k, 1 <k <j, vo(h1) =k if and only if 5 is a 2—*-th power residue, but not 
a 2/+!~*_th power residue (mod p). The proportion of such power residues 
in (Z/pZ)* being equal to 


[(p- 1)/29-* -~(p— 1)/27+3-*] /(p - Hie goin 1th 


the probability that v2(h,) = k is assumed to be 2-J~1+*. The same assump- 
tions are made if we replace 5 by 3. Therefore, we have: 


r= 3 Prob, (p—1)=3{¥2(h1) = k} x Prob, (p—1)=;{V2(ha) = k} 


k=1 

u k u th 1 
(Q-7-**) 4+—_(1-49 

DA = a ) 

k=1 t= 1. 


The density of primes d; satisfying v2(p — 1) = j being 2~J, our expected 
density for A is obtained by summing over j 
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So dj x Pj = 3|D2?-bs"| a 


j21 j21 j21 


ae cn ane 


‘)=3 6-2 


2. Let us consider the cubic recursion T, = 3°+! + 2"+3 — 5, n > 0. We 
now present a heuristic yielding the prime density of the set 


B={p; pt{30, p| gcd(Tn, In+2) for some n}. 


Recall that if a prime p divides both the n-th term and n + 2-th term of a 
cubic recursion (X,,) for some n > 0, then we say that p is a twin divisor of 
(Xp). 

Since T,,42 — 4T, = 15(3" + 1) and T,,42 — 9T, = 40(—2” + 1), a prime 
p lies in B if and only if the system of exponential congruences 


(Sr) : es 7 Sree) (9.32) 


is solvable in n. Letting h1 and hg be the respective orders of 2 and 3 (mod p), 
(Sr) is solvable if there is some odd multiple n of h2/2 which is also a multiple 
of h,. This holds exactly when v2(h2) > 1+v2(h1). Fixing 7 > 1, let P; denote 
the probability that v2(h2) > 1+ 2(h,) as p varies through primes satisfying 
V2(p — 1) = 9. Thus, 


j-1 
P; = )_ Prob{v2(h1) = k} x Prob{v2(h2) > 1+ k}. 
k=0 


Hence, P, = Prob{v2(h1) = 0} x Prob{ve(h2) = 1} = 1/2 x 1/2, since we 
expect 2 to be a quadratic residue of p for half the primes p = 3 (mod 4), and 
we expect the same proportion holds true for 3. For 7 = 2, p = 5 (mod 8) 
so 2 is a quadratic non-residue of p. Therefore, 12(h1) = 2 and P: = 0, since 
V2(h2) cannot exceed 2. For j > 3, 2 is a quadratic residue of p. Now, for 
k,0<k < j-—1, %2(h1) =k implies that 2 is a 2%-*-th power, but not a 
2)-k+1_th power (mod p). The proportion of 2/~*-th power residues which 
are not 2/-**+1_th power residues among quadratic residues is 


(p= 1/2?" —@— 1/2") | (@—D/2) 2°", 


for 1 <k<j-—J1. And for k = 0 this same proportion is 


((p —1)/2) / ((p— 1)/2) = 2°97", 


since V2(p—1) = j implies that there are no 2/*1-th power residues (mod p). 
Thus, we assume Prob{v(hi) = k} to be 2-/+! if k = 0, and 2-/+* if 
1<k<j-1.A similar argument gives: 


eee i 
Prob{ve(h2) 21 +k}= 5+ G+-7-4 eee a aa 
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Therefore, we evaluate P;, j > 3, as 
j-1 2 
2-941 x (1-2-4) 4 $0 2-G-4%) x 1-2 GO) = F144), 
k=1 3 


after summing and simplifying. The expected prime density of B is given by 
the sum 


yo k= 


j21 


1 . 2 
gtr) *xgx(l—4 J) 
J23 

P.2 2 27 1 

8 

7 


TS 4 3° 8 72078 
_ 1 195 «65 
~ 94 24x28 24x28 224° 


As mentioned earlier, there is a corresponding cubic linear recurrence in 
F(f), where f = (a — 1)(a — 2)(a — 3), which has B as its set of maximal 
prime divisors. 


Early experimental data suggested that when f is a cubic polynomial with 
three linear factors over Z[x], the general non-torsion cubic linear recurrences 
in G(f) have very few maximal prime divisors; see the first two tables in [5, 
p. 44]. Partial results showing, indeed, this density to be 0 were achieved in 
[15, 16]. The two above heuristics — see Eqs. (9.31) and (9.32) — show a prime 
p is a maximal, or a twin divisor, of a cubic recurrence if and only if a system 
of two congruences of the shape 


a” =b (modp) and c”=d (mod p), 


is solvable in n. In particular, in [16], the case where the rank of the group 
(a,b,c,d), generated by a, b, c, and d in Q*, is at most 2, is examined in 
detail. 

If f(z) = 23 — Pa? + Qu — R = (x — a)(x — G)(a — c), where a is a 
quadratic algebraic integer, @ is its algebraic conjugate and c is a nonzero 
integer, then only one of the three recurrences (V;) in M(f) has rational 
integer terms, namely, the recurrence with initial values T = [2, 2c, Pc— ad]. 
The class of T has order 2 in the generalized Laxton group G(f). If c = 1 
and a is a fundamental unit of norm —1 of a quadratic field Q(Vd), d > 3, 
then T, = 1+ Un,-1 + Un, where U is the Lucas sequence U(a + @, —1). If 
a = (14+ V5)/2, then T, = 1+ F,41 and its density of maximal divisors, i.e., 
of primes p for which two consecutive Fibonacci numbers are congruent to 
—1 modulo p, was computed in [7]; it is equal to 2/3. (It is the set of primes 
for which the multiplier mp(p) is either 2 or 4 — see the end of Sect. 9.7.1.) 


As explained in Sect. 6.4, if D is a non-cube integer and 6 = WD, raising 
a fundamental solution a = X; + Y,6 + Z,6? of the cubic Pell equation 
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a + Dy? + D?z? —3Dzyz = 1, (9.33) 


to the n-th power, n € Z, yields all integer solutions of equation (9.33). The 
algebraic conjugates of a are 8 = X, + Yjw?d + Z,w6? and £, the complex 
conjugate of 8, where w = e"/3, Put f(x) = (a — a)(x — 8)(a — B) € Za]. 
Define (X;,), (Yn) and (Z,,) by 


Xn + 6¥, +02, = 0”, 
for all n. Then, we find that 


3Xn =a" + 6” +B, 
36), = of + oO" pw" 8, 
30°2Z, =a" +a 8" +wAe”, 


ie., we get, up to a factor of 3, a set of Williams’ sequences. These sequences 
were the object of the doctoral thesis [81] and were discussed in Sect. 6.5. 

Remarkably, it was shown in [8], given M > 1, ¢€ = +1, D= M3+«€ #0, 
where it is known that a = M?+M6+6? is a fundamental solution to (9.33), 
that writing the recurrences (3Xy_1), (3Yn—1) and (3Z, 1) in their standard 
form (9.23) yields: 


(67, 6?w, 62w7), (6,dw?, dw) and (1,1,1). 


This readily shows that the class of X is a torsion element of order three 
in the generalized Laxton group G(f). This class generates a group of order 
three containing the class of Y, also of order three, and the class of Z, the 
identity element. 

The density of maximal divisors of X, or of Y, is generically equal to 
51/104, ie., a tiny bit less than 1/2.‘ The ranks pmax of all prime maximal 
divisors of X, or Y, are multiples of 3, but there are primes not dividing X 
or Y such that 3 | pmax. In fact, it was shown that the set {p; 3 | pmax(p)} 
generically has the density 157/312, i.e., slightly more than 1/2. This is the 
case, for instance, when M = 7 and a = 4+2V/7+ W49. The method of proof 
is again an extension of the Hasse-Lagarias method, with the caveat that for 
companion Lucas sequences (V(P, Q)(n) = a" +B"), where P? — 4Q is not a 
perfect square, the Hasse-Lagarias method required computing separately the 
density of prime divisors according to whether they split, or remained inert 
in Q(./ P? — 4Q). Here, the method computes separately three cases: primes 
2 (mod 3), which split into three prime ideals in Q(d,w); primes 1 (mod 3) 
such that D is not a cube modulo p which split into two prime ideals; and 


+ As noted by H. Roskam (private communication) and, later, P. Moree (Mathscinet Review 
of [8]), there are exceptions, not pointed out in [8], where the density differs from the generic 
value quoted herein. 
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primes 1 (mod 3) for which D is a cube modulo p which split into six prime 
ideals in Q(6,w). 
There may be other irreducible cubic polynomials: 


f(x) =a° — Pa’ + Qu — R= (x—a)(x — B)(z@— 7) € Ze] 


such that G(f) contains torsion elements for which the density of maximal 
divisors may be calculated through a similar method, and perhaps, other f’s 
for which (a” + 6” + y”) is also torsion of order three. Much remains to be 
discovered. 


= 


9.9 Irreducible Polynomials Dividing (X"+ 1), in Fy[X] 
Lucas sequences are sequences of rational integers, but many of the identities 
they satisfy would hold in other rings than the ring Z. Indeed, if R is a ring 
(with 1) and P and Q are nonzero elements of R, then one may define the 
pair of Lucas sequences U(P,Q) and V(P, Q), as we did in Chap. 2, by their 
initial conditions Up = 0, U; = 1; Vo = 2, V; = P, and the common recursion: 


Xn+2 = PXp4+1 a QXn, 


for all n > 0. 

Their arithmetic properties could offer some variations worthy of curiosity, 
or of interest. If p is a prime and e > 1 an integer, then we denote the field 
with ¢ = p* elements by F,. A few arithmetic properties of Lucas sequences 
in the ring F, were studied in [48]. To our knowledge, no detailed study of 
the arithmetic of Lucas sequences in the ring F,[X], of polynomials over F,, 
has been undertaken. 

However, the rings Z and F,[X] share much in common. Both are eu- 
clidean rings. The q — 1 elements of Fj replace the two invertibles +1 of Z. 
Monic polynomials correspond to the positive integers N. Monic irreducible 
polynomials in F,[X] correspond to prime numbers in N. Thus, we will often 
use the term primes in F[X] for monic irreducible polynomials. The number 
of primes in F,[X] of degree n, (n > 1), is denoted by I,,. It satisfies the 
well-known formula: 


1 nm 
In = — Duda’, 
d\n 


where yz is the Mobius function introduced in Sect. 4.3, and the sum is over 
all positive divisors d of n. It is an easy matter to verify than I, ~ q”/n, as 
n — oo. This asymptotic equivalence between I,, and q"/n is usually referred 
to as the prime number theorem (PNT) for the ring F,[X]. Indeed, q” is the 
number of monic polynomials of degree n in F,[X], and n = log,(q”). Thus, 
the quotient g”/n is an analogue of the classical PNT, m(a) ~ x/log x. The 
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norm |P| of a monic polynomial P of degree n > 1 is the cardinality of the 
quotient ring F,[X]/P, i.e., |P| = q”. Thus, there is a definition of Dirichlet, 
or analytic density analogous to (9.2). If S is a set of primes in F,[X], then 
S' is said to have analytic density 6 = 6(S) € [0,1] if and only if the limit 


Pl? 
fn eres IPI 
esi Joe (Pl 


exists and is equal to 6. There are, in the F,[X] context, analogues of 
the Dirichlet, the Kronecker-Frobenius, and the Cebotarev density theorems 
which ensure sets of primes possess an analytic density as defined in (9.34). 
The notion of natural density which seems to be used most often is the fol- 
lowing. If S Cc I, where I is the set of primes in F,[X], and the limit 


(9.34) 


_ On 
Jim 7 (9.35) 
exists, then it is the density of S. Here, S, is the number of primes in S 
of degree n. In [9], the density (9.35) is referred to as the d,-density of S. 
Perhaps, one compelling reason why d is often considered a sensible analogue 
of the usual natural density of rational primes is that d,(S) exists if and only 
dz(S) exists, where d2(S), provided it exists, is defined as 


N N 
slim, (>: s,) / @ ) Jt (9.36) 


n=1 


The d2-density is cumulative, whereas d, is local to degree n primes. But 
because of the fast growth of [,, with n, the two densities d; and d2 are 
equivalent in the Murata sense — see the end of Sect.9.1; refer to [9, Prop. 
1.8] for a proof of the equivalence between d; and dz. According to Bilharz, 
a hint that the dj-density is not an appropriate analogue of the classical 
natural prime density, came from Davenport. Bilharz proves [21] that the set 
of primes P in F,[X] having X as a primitive root, i.e., such that the order 
of X in the multiplicative group of the finite field F,[X]/(P), is p’ —1, where 
P has degree n, does not have a d2-density. In the classical setting, primes 
having a given integer a as primitive root are, under Riemann hypotheses, 
known to possess a natural density [32]. Thus, Bilharz’s result came as a 
surprise. 

Consider the companion Lucas sequence V(X + 1,X) = (V,,) in Fy[X] 
with initial values 2 and X + 1 and defining recursion: 


Vin+2 = (X +1)-Vn41—-X-Vn, (n > 0). 


+ Another reason is that the PNT in F,[X] only considers primes of degree n. See also 
[57, Thm. 9.13B] for a version of the Cebotarev density theorem valid with respect to 
d,-density. 


9.9 Irreducible Polynomials Dividing (X” + 1) 261 


It turns out [10] that the Hasse method is valid in this context, and the 
analytic densities of prime divisors of the sequence of polynomials V,, = X"+1 
can be computed for all prime powers q. For instance, if g = 3, we find that 


d({P prime; P| V,,, for some n}) = 17/24, (9.37) 


where P | V,, means that there is a A € F,[X] such that P-A = V,,. But these 
sets won’t generally have a d2-density. In [9], a detailed discussion led to the 
conclusion that another notion of density, called d3, was a better analogue 
of natural density than d;, or dz. A set S C I possesses a d3-density, d3(S), 
equal to 


2 


N- oo 


lim N71 we (9.38) 
1 n 


n 


provided this limit exists. In fact, it is possible [10] through an elementary 
method, which does not use the versions of the Kronecker-Frobenius or Ce- 
botarev theorems in F,[X], to see directly that prime divisors of the V- 
sequences, such as (9.37), have a d3-prime density. Hence, by (9.37), primes P 
such that the order of X modulo P in F,[X] is odd have d3-density 7/24. As in 
the classical case, a numeric check is feasible. Thus, 7/24 ~ 0.2917, and it was 
calculated [10, Rmk. 4.6] that ri, ~ 0.2868, where ry = N7!0*_, S,,/In 
is the N-th approximant to the d3-density. Note that since, in the Murata 
sense, we have [9]: 
6 ~ dz ~ do, 

the elementary method implies the analytic densities of those sets. Primes P 
in F,[X] for which X is a primitive root of P do not have a d-density, but 
they do have a d3-density [9, pp. 322-324]. 

Suppose @ is an odd prime number. If a is an integer, not an ¢-th power in 
Z, then the classical Hasse method ([31], [5, Thm. 3.2.3]) yields the natural 
density ¢/(¢€? — 1) for the set of rational primes: 


{p; £ divides the order of a (mod p)}. (9.39) 


(For @ = 2, we often have a 2/3-density which the same formula ¢/(¢? — 1) 
provides correctly.) 

The elementary method of [10] was extended further [11] to show the set 
of primes, analogous to (9.39), i.e., 


{P € F,[X]; @ divides the order of X (mod P)} (9.40) 


; (1 ea) 


has a d3-density equal to 
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where @ is an odd prime, f is the order of g (mod £) and v = vp(qf — 1). We 
note that if qg is a primitive root modulo @ and v = 1, then the above density 
yields, as in the classical case, ¢/(¢? — 1). 


The papers [9-11] have barely opened the door to the problem of determin- 
ing the prime density of companion Lucas sequences in F,[X]. For instance, 
one may ask whether it is possible to extend the elementary method that 
proved prime divisors of V(X +1, X) have ds-density to the companion se- 
quence V(X,—1)? Note that the discriminant associated with V(X + 1, X) 
is (X — 1)?, but the discriminant of V(X,—1), D = X? +4, is not a square 
in F, [2], if g=p°, p > 3. For p= 2, D=(X +:2)7,) 
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Chapter 10 
Epilogue and Open Problems 


Study hard what interests you the most in the most undisciplined, irreverent and 
original manner possible. (Richard Feynman) 


Il passa sa licence a la faculté des sciences de Montpellier, sans que ses professeurs 
le remarquent (ni du reste qu’il remarque ses professeurs). (Laurent Schwartz a 
propos de Grothendieck) 


Abstract We conclude with a brief review of the topics introduced in this 
book and Lucas’ connection to them. This is followed by an eclectic assort- 
ment of unsolved problems. 


10.1 Epilogue 


As we have seen in Chap.1, Lucas claimed that a reason for his develop- 
ment of the theory of his sequences was their possible utilization in a proof 
of Fermat’s last theorem. Undoubtedly, this played a significant role in his 
interest in these sequences, but his own work strongly suggests that the main 
impetus for this research evolved from his fascination with the Fibonacci se- 
quence. The progression of his papers clearly displays how his discovery of 
the properties of the Lucas sequences developed from his empirical study of 
the Fibonacci numbers. He summarized much of this work in the first part of 
[9] and more extensively in his memoir [10], the main body of which appears 
in a substantially extended and revised form in Chap. 2, which is devoted 
to a number of basic results concerning the Lucas sequences—some of these 
were known to Lucas, but many were not. Indeed, this entire book can be 
considered as a greatly expanded version of [10]. 


At the beginning of his inquiries, Lucas was particularly interested in how 
the theory of his sequences could be applied to the problem of primality 
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testing, particularly to the Mersenne numbers. Later, he extended these ideas 
to numbers of the form Ap” +1, where p is a prime. This is discussed in some 
detail in Chap.3, as are a number of other applications of his sequences, 
extending even to cryptography. In view of Lucas’ interest in this subject 
[17, Sect. 15.3], he would likely have found this development very gratifying. 


As mentioned in Sect.4.1, Lucas discovered the connection between his 
sequences and the circular and hyperbolic trigonometric functions. This is 
discussed at greater length in Chap.4 (and Chap.6) as are several other 
features of his sequences. In particular, we provide a correct proof of his Fun- 
damental theorem in Sect. 4.3. On p. 291 of [10], he introduced the concept 
of a proper divisor of (U,,). This is a prime which divides some U,,, but does 
not divide U,, for any positive m < n. This, of course, is what we call a 
primitive divisor today. Lucas believed that if D > 0 and n is sufficiently 
large, then U,, would possess at least one primitive divisor, but as we saw in 
Sect. 4.5, it wasn’t until much later that Carmichael was able to prove this. 
In his attempt to extend some of Lucas’ tests for the primality of Mersenne 
numbers to exponents of the form 4k + 1, Lehmer was frustrated by the fact 
that D cannot be 2 or 3 modulo 4; this caused him to devise what are today 
called the Lehmer sequences. These sequences are defined and some of their 
properties are given in Sects. 4.4 and 5.9. 


As we mentioned at the beginning of Sect. 5.2, Lucas [10] did not introduce 
the Lucasnomials, but made the fundamental observation that makes these 
objects integers, i.e., he showed why the product of the first n terms U; --- Up, 
of a regular Lucas sequence divides the product of any n consecutive terms 
of the sequence. He did not make any other substantial contribution to their 
study. Chapter 5 is concerned with the enormous number of developments 
concerning Lucasnomials, which, omitting work on g-binomials, should have 
started with Ward’s paper [15] of 1936, but really started with Gould [6] 
in 1969. Lucasnomials, which are a nontrivial generalization of ordinary bi- 
nomial coefficients, surprisingly share extended versions of various classical 
arithmetic properties of binomial coefficients. One of these properties, Lu- 
cas’ congruence, which is the object of Sect.5.7, was actually discovered by 
Lucas himself. Other surprises are an extended rule of Kummer that deter- 
mines their p-adic valuation and various forms of the famous Wolstenholme 
congruence modulo the square or the cube of a prime treated in Sect. 5.8. 

In Chaps. 6, 7, and 8, we dealt at some length with the problem of gen- 
eralizing the Lucas sequences. As discussed in Sect. 6.1, this problem was of 
great interest to Lucas, and certain aspects of it were examined in [10], but 
for several possible reasons, Lucas never made much headway into its solu- 
tion. We now know of several different possible extensions or generalizations, 
depending upon what it was that Lucas might have considered to be appro- 
priate. In Chap.6 we dealt with Lucas’ conviction that an extension of his 
sequences might be found by considering linear recurring sequences of order 
three or sequences of symmetric functions of the zeros of cubic polynomi- 
als. In Chap. 7, we examined the application of linear recurrences of degree 
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greater than three to this problem. Also, the Lehmer functions were general- 
ized, culminating in an algorithm which is both necessary and sufficient for 
establishing the primality of Ap” + 1, where p is a prime. As it appears that 
divisibility sequences played an important role in Lucas’ thinking, Chap. 8 
is dedicated to the application of a few types of these sequences, including 
elliptic divisibility sequences and certain linear divisibility sequences, to the 
generalization problem. 


Finally, in Chap. 9 we looked at an aspect of another problem that Lucas 
considered: the structure of the set of prime divisors of the terms of the (U,,) 
sequence. We studied this earlier in Chap. 2 and to some degree in Sect. 4.3, 
but here the emphasis is on the existence and determination of the density 
of the primes that divide the terms of the (V,,) sequence. For example, the 
density of the primes that divide the terms of the sequence (L,,) is 2/3. In 
Chap. 9, we also discussed several directions in which the techniques of Hasse 
and Lagarias for proving the prime density of the (V,,) sequences could be 
extended. For instance, one direction seeks asymptotics for the sets of primes 
whose rank in a given Lucas sequence U(P, Q) is divisible by a fixed integer 
m > 2. Another consists of discovering special cubic recurrences where primes 
that divide two consecutive terms of the recurrence have a density that can be 
computed unconditionally. Yet another direction sought a notion of “prime” 
natural density in polynomial rings over finite fields which allows analogous 
work. This density calculation was carried out successfully at least for the 
polynomial companion Lucas sequence (X” + 1)n>0 in all such rings. 


We have seen, then, that much has been learned about the Lucas sequences 
since the end of the nineteenth century. Indeed, it seems remarkable that such 
a large amount of activity has been devoted to such a simple pair of sequences, 
but still there seems to be much more to do. A glance at publications such 
as the Journal of Integer Sequences, the Fibonacci Quarterly, Integers, and 
the Online Encyclopedia of Integer Sequences (OEIS) suggests that interest 
in these sequences shows no sign of diminishing. 


10.2 Some Unsolved Problems 


In the course of writing this book, the authors encountered some problems 
concerning the Lucas sequences or their possible generalizations that are of 
some interest (at least to us) and remain unsolved or unexplored. Naturally, 
a collection of such problems, reflecting the interests of the authors, is both 
eclectic and idiosyncratic, but it is our hope that bringing them to the read- 
ers’ attention may stimulate further research. We have resisted the tempta- 
tion to include questions that, while related to certain Lucas sequences, are 
well studied and still remain unanswered; for example, the Lucas sequence 
(U;,(3,2)) is the Mersenne sequence, concerning which there are many well- 
known mysteries. 
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Fibonacci- Wieferich primes. From results in Chap. 2, it is easy to see that 
a prime p (4 5) will always divide U,_.(1,—-1) = Fy--, where e = 1 when 
= +1 (mod 5) and € = —1, otherwise. The question is: does p? ever 
divide F,_-? The Wall-Sun-Sun conjecture mentioned in Remark 12 sug- 
gests that the answer is no. Such a prime is called a Fibonacci-Wieferich 
prime. A computer search over all primes less than 2 x 10'4 conducted 
by McIntosh and Roettger [11] was unsuccessful in finding such a prime, 
and the question remains as to their existence. The authors of [11] give 
some statistical reasoning to suggest that Fibonacci-Wieferich primes 
might exist, but this kind of argument, while seemingly reasonable, does 
not take into account aspects of the structure of the Fibonacci numbers. 
See, for example, Williams [16]. This suggests the additional problem of 
investigating the properties of other Lucas-Wieferich primes. 


Common prime divisors of two companion Lucas sequences. We have 
discussed in Chaps. 2 and 9 the prime divisors of V,,(P,Q). Suppose we 
have two such sequences: (V,,(Pi,Qi)) and (V;,(P2,Q2)). Show that if 
D, Dz is not a perfect integral square (D; = P? — 4Q; for i = 1,2), then 
there must exist some n > 0 and an odd prime p such that p | V,(P1, Q1) 
and pt Vn(P2,Q2). Notice that if this is so, then we cannot have 


Vin(P1, Q1) | 2V,,(P2, Q2) for all n > 0. 


The ideas in Chap. 9 can be used to explore many instances of this prob- 
lem, but a complete answer to this question is still unknown. 


. pth power nonresidues. This problem concerns the primality testing al- 


gorithms presented in Chaps. 3 and 7 for N, = Ap” + y, where y = +1, 
p is a fixed prime, A is a fixed integer and p{ A. When J, is not a per- 
fect pth power, does there exist an efficient algorithm for finding a prime 
q such that g = 1 (mod p) and Neve # 1 (mod q) ? This question 
was discussed at some length in Sect. 16.4 of [17] and in [14]. It appears 
that such an algorithm might exist, but none is known which can deal 
effectively with all values of N/,,. 


. Solution of Gs(x) = 0 (mod r). This is another problem that arises in 


connection with primality testing of integers of the form N,, in Problem 
3. It is well known that if p is an odd prime, s = (p — 1)/2, and r = y 
(mod p) is also a prime, then the congruence 


G,(x) =0 (mod r) 
has s distinct solutions (when 7 = 1, the congruence holds for x = g+g~', 
where g is any primitive root of r; when y = —1, see, for example, 
Theorem 9.4.3 of [17]). Is it possible to characterize all the values of 
r for which there is an efficient polynomial time algorithm for solving 


10.2 Some Unsolved Problems 271 


this congruence? Some aspects of this problem are briefly discussed in 
Sects. 9.4 and 11.3 of [17], but much about it remains unknown. Of course, 
there is such an algorithm if we have a fast method for finding a generator 
of the multiplicative group of the elements of the finite field F,, where 
q =r when y = 1 and q = r? when y = —1; however, while it may be 
possible to do this for certain values of r, a general technique is not known 
and might not even exist. Several features of this problem, particularly 
when y = 1, are discussed in Moree’s extensive expository paper [13] on 
Artin’s primitive root conjecture. 


5. Absolute pseudoprimes which are also absolute Lucas D-pseudoprimes. 
This problem was mentioned at the end of Sect. 4.3. Do there exist any N 
and D such that (D| N) = e(N) = —1 and N is both a Carmichael num- 
ber and an absolute Lucas D-pseudoprime? As mentioned in Sect. 4.3, 
such values of N must be the product of an odd number of distinct 
primes p satisfying p—1|N—landp+1|N+1. 


6. Linear divisibility sequences of order three. The following problem is 
motivated by the discussion in Sect. 6.2. Classify all the third-order, non- 
degenerate LDSs (X,,). We know that there are no (X,,) when the charac- 
teristic polynomial f(x) of (X;,,) is irreducible, but what about the other 
two cases? Hall thought that Ward had disposed of one of these cases, 
but as we mentioned, there doesn’t seem to be much evidence in support 
of this. In this case we have f(x) = (a —r)(x? — sx +t), where r, s, t are 
integers and x? — sx+¢ is irreducible; we have f(x) = x3 — Px? +Qzx—R, 
where P=r+s,Q=rs+tand R=rt. It is then easy to verify that 
Xp, = U,,(s,t) satisfies the third-order linear recurrence: 


Xn43 = PXni2—- QXn41 t+ RXn, (10.1) 


but this (X,,) is of order two. Suppose that (X,,) is not a Lucas sequence 
as above. By using Hall’s [8] method, it can be shown that if (X,,) is a 
divisibility sequence, is nondegenerate, and satisfies (10.1), then s 4 2r 
and X, = (2r” — U,(s,t))/(2r — s). If t = r?, we must have Q/P =r, 
and if we put w = P+r = 2r+s, it is not difficult to verify that X,, 
must be U,(./w,r)?. Notice that U,,(./w,r)? is essentially the square of 
a Lehmer function (see Sect.4.5), and the corresponding sequence is a 
divisibility sequence satisfying (10.1). If ¢ 4 r?, then it seems that there 
are no divisibility sequences satisfying our conditions, but this remains to 
be proved. Very little is known about the case when f(a) is the product 
of three distinct linear polynomials. 


7. Linear divisibility sequences of order four. We have seen in Sect. 8.3 that 
if (X,,) is a nondegenerate LDS of degree four, then its characteristic 
polynomial must be of the form: 


272 


10 Epilogue and Open Problems 


f(x) =2* — Pa® + (R+2Q)x* — PQz + Q’, 


for some integers P, Q, R. The problem is that of determining all 
the LDSs with characteristic polynomial given by f(a). In Sect.8.5 we 
pointed out that there is always one and only one even LDS of order 
four with characteristic polynomial f(x): the standard sequence. Put 
A= P?—4R and E = (R+4Q)?—4QP? and let A = Su? and E = Gv’, 
where S, G, u, v are integers and S and G are both square-free. It was 
shown in [19] that this problem has been solved for all odd sequences 
unless 


(a) G=1, S41, 
(0) G=S#1, 
(c) G=S=1. 


In Case (a), it is strongly suspected that no such sequence exists. By 
results in [19], this would be true if Problem 2 is ever solved. An extensive 
computer search did not uncover any odd nondegenerate LDSs when Case 
(b) holds. Indeed, if we make use of the notation and results in Sect. 7 
of [18], it can be proved that a necessary and sufficient condition for Az to 
be an integer is that (X/e) | H, where H = gcd(u, v) and e = gcd(Y, 2T). 
By making use of this result, we can narrow down the possible instances 
of Case (b) where we get a nondegenerate odd LDS. In fact, it can be 
shown by a lengthy, but elementary, argument that there cannot be any 
if A = E. When Case (c) holds, we have seen in Sect. 8.5 that there are 
many odd LDSs. The question in this case is: have we characterized all 
of them? 


. Classification of all linear divisibility sequences. Problems 6 and 7 are 


subproblems of this more general problem: for a given k > 2, is there a 
scheme which can be used to find all the LDSs of order k? This seems to 
be a very deep and difficult question. Certainly, it seems difficult for just 
k = 3 or 4. The only general tool currently available is Theorem 125, but 
this is not sufficient for solving the problem. What is needed is a result 
like Conjecture 126, but this has not yet been proved, and as pointed out 
in Sect. 8.3, there are difficulties in applying it. 


Note added. It seems that very recently, Andrew Granville [7] has com- 
pletely solved this problem. For example, as a consequence of this investi- 
gation, he can show that the only nondegenerate LDS of order three must 
be either (n2a"~'), where a is an integer, or (U2), where U, = Un(/w,r) 
as in Problem 6, is some Lucas or Lehmer function, thereby providing a 
complete solution of Problem 6. This is a most remarkable solution to 
a long-standing problem, but the reader should be cautioned that, while 
the conclusions the author derives in [7] are easily understood, the tech- 
niques employed are very deep and well outside the scope of this book. 
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While [7] has yet to complete the peer review process, the ideas presented 
are convincing, and with inevitable minor corrections, the paper will very 
likely be accepted for publication. 


9. Lehmer sequences. These were introduced in Sect. 4.4. 


Using the Lehmer sequence U = U(R,Q), which is a strong divisibility 
sequence (if gcd(R, Q) = 1) and a fourth-order linear recurrence, we may 
consider the Lehmernomial coefficients 


defined in analogy with Lucasnomials. As we saw in Sect.5.9, they are 
integers. We also saw that Lehmernomial Catalan numbers 


+ 2n 
-1 
ost (™) 


are always integers and Lehmernomial Fuss-Catalan numbers are too. 
All questions about Lucasnomials covered in Chap.5 would need to be 
explored in the context of Lehmernomials, in particular finding out a 
combinatorial interpretation for Lehmernomials and Lehmernomial Cata- 
lan numbers, finding out a generalized Legendre formula, discovering a 
Kummer rule, evaluating their p-adic valuation, finding out whether they 
satisfy an extended Lucas congruence and some versions of the Wolsten- 
holme theorems. We ask a few questions in a more precise manner. 


(i) In Sect.5.5.4, we saw that all Catalan-like triples (U,a,k), i-e., such 


that the numbers 
1 (*") 
Ua- Intk \r/yu 


are integers for all n > 0, have been discovered. The complete classi- 
fication [5] of ¢-defective Lucas sequences was one of the major tools 
that allowed this search to be successful, but the same paper [5] also 
includes the classification of ¢-defective Lehmer sequences. Thus, the 
problem: find all Catalan-like triples (U,a,k), i.e., such that 


1 es 
U(a—1)n+k n a 


(ii) To date, we are not aware that the prime density of any companion 
Lehmer sequence V = V(R,Q) has ever been computed.' This should 


are integers for all n > 0. 


+ Even though this very question was asked by Alan Baker in 1994 at the end of a lecture 
of C. Ballot. 
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(iii) 


(vi) 


10. 


11. 
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be feasible because the complementary set of primes is simply the set of 
prime divisors of the second-order linear recurrence (U2n+1(R, Q))n>0- 
Of course, the more general order problem, i.e., estimating the prime 


density of sets 
{p; m | p(p)}, 


where m > 2 is a fixed integer, p is the rank of p in the Lehmer sequence 
U(R,Q), is also awaiting investigation and might be more challenging. 


Is there a Wolstenholme phenomenon for pairs of Lehmer sequences 
when p is a prime of maximal rank? That is, suppose the rank of p in 
U(R, Q) is p = p— eo, where e = (D|p), D= R— 4Q and o = (R|p), 
would we find out that some sums 


Vn 


nes Un 


are 0 (mod p?), for some sets $? Do Lehmernomials satisfy an analogue 
of either of the congruences (5.83) or (5.84), which are two generaliza- 
tions to Lucasnomials of the classical Wolstenholme congruence (5.71)? 


Is there an analogue to Theorem 88, the generalization of Lucas’ con- 
gruence of Hu and Sun, for Lehmernomials? 

Is there a relevant analogue of the Laxton group associated with a 
quadratic polynomial f(x) = «2 — /Rx+Q, R, Q integers, in which 
the Lehmer sequences would play a central role, and such that its binary 
operation would preserve division by primes? (See Sect. 9.7.) 

Develop properties of the polynomial Lehmer sequences in F,[X] asso- 
ciated with Y? - /RY + Q, where R and Q are in F,[X]. 


Divisibility sequences and Catalan numbers. Is it true that for every 


strongly divisible integral sequence (X,,), the numbers 


1 2n 
, 10.2 
Xn4+1 ( n ), ( ) 


are integral? More generally, which divisibility sequences satisfy (10.2)?! 
If the first question has a positive answer, then we would have another 
rich collection of generalized Catalan numbers. Investigation of Fuss- 
Catalan and generalized Fuss-Catalan numbers in this abstract setting 
could then follow. 


Maximal divisors of cubic recurrences. Several of the open problems we 
cite here were alluded to in Sect. 9.8. 


* A question along these lines was asked by an anonymous referee who reviewed [2]. 
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(i) 


(iii) 


(vi) 


Let f(x) = (a —1)(a? — x —1) = x3 — 2x? +1. Is it true that the only 
integral cubic linear recurrences in F(f) with positive prime density 
of maximal divisors are shifts and/or rational multiples of either (—1+ 
Fy41) or (1+ F 41)? Note that (—14+F;,+1) is the fundamental sequence 
with initial values 0,0,1, whereas [2, 2,3] = 1+ F,,41 is torsion of order 
2 in the generalized Laxton group and has density of maximal divisors 
2/3; see [1]. 

Complete the work, alluded to toward the end of Sect. 9.8, initiated by 
Ballot and Luca [3, 4], to prove, or disprove, that maximal prime divisors 
of non-torsion recurrences with characteristic polynomial f(a) = (a — 
a)(a — 8)(a — y), where a, 8 and ¥ are nonzero integers of distinct 
absolute values, have zero density. 

Can one adapt, at least in some cases, the conditional argument Moree 
and Stevenhagen used in [12|—see Remark 44—for determining the 
prime density of the second-order non-torsion Lagarias sequence (de- 
noted Y), to finding out the prime density of the maximal divisors of 
a typical non-torsion third-order linear recurrence M = (M,,)? Say for 
(M,,) defined by the recursion 


Mn+3 = 5Mn+2 _ 9Mn+1 + 5M, 


and the initial values [1,—2, 1]? 

Given f € Z[z] monic of degree > 4, is there a relevant group structure, 
G(f), with respect to maximal division, whenever f has exactly two 
double zeros, or, instead, a single zero of multiplicity 3? 


Suppose D is a positive integer, not a perfect cube, and (21, y1, 21) isa 
fundamental solution to the cubic Pell equation: 


a + Dy? + D?z4 — 3Dzyz = 1. 


Put a = 2, + dy, + 6721, where 6 = WD, and let 8 and ¥ be the 
algebraic conjugates of a. Define three integral sequences X = (2), 


a" = In + Ynd 1 eae 


Besides the family of discriminants D of the form M*?+ 1 discussed in 
Sect. 9.8, can we find other families of discriminants for which X, Y, and 
Z are torsion recurrences in G(f), where f = (a — a)(x — B)(a — y)? 
Assuming a positive answer, can we compute their prime density of 
maximal divisors? How often would the class of the sequence (a” + 
8" +") turn out to be of order three in the generalized Laxton group 
G(f)? Could its density of maximal divisors be evaluated? 


Define a primitive prime maximal divisor of U,, where U = (U,,) is a 
fundamental recurrence with characteristic monic polynomial f in Z[z], 
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(vii) 


12. 


(i) 


13. 
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as a prime which is a maximal divisor of U at n, but is not a maximal 
divisor of U at any k, 0 < k <n. Are there cubic recursions (with 
irrational roots) for which we can entirely describe the set Z of indices n 
for which there is a primitive prime maximal divisor of U = [0,0, 1] at n? 
If so, would there exist, for some of these recurrences, a positive integer 
N such that at least one of the two conditions (i) gcd(U;, Un41) < N, 
or (ii) n € Z, holds for all n > 1? 


Compute the density of primes P in F,[X] which have an even rank 
Pmax(P) of maximal division for the recursion defined by 


Yar =x -)e <1), 


and initial values 0,0 and 1. 
Can we use an elementary method which shows this set of primes has 
a dz density? (See Sects. 9.8 and 9.9.) 


Lucas theory in other rings than Z. 


Develop the theory of Lucas sequences U(P, Q), V(P, Q) in F,[X] much 
in the way it is done in Chap. 2 of this book. Here P and Q are nonzero 
polynomials in F,[X]. 

Would the study of the analogue of the Laxton group for second-order 
recurrences of polynomials in F,[X], associated with the characteristic 
polynomials Y* — PY + Q, have features of interest? 


Develop the basic theory of Lucas sequences, as in Chap. 2, in the rings 
of Gaussian integers Z[i], or Eisenstein integers Z[w], where w = e?7/9, 
or in rings of integers in general, but also in more esoteric rings such as 
Drinfeld modules, or the ring of Hurwitz quaternions, i.e., quaternions 
a+bi+cj+dk, where a, b,c, and d are all rational integers, or all halves 
of odd rational integers. Here, the three quaternions i, j, k satisfy the 
quaternionic relations i? = 7? = k? = ijk = —1. 


Wolstenholme’s phenomenon. See the remarks made at the end of Sect. 5.8. 


Find a common characterization for all pairs of second-order recurrences 
X and Y that share the same quadratic characteristic polynomial «2? — 
Px+Q € Z{a], and satisfy for all, but finitely many primes p of maximal 
rank, i.e., of rank p, or p+1 (with respect to the Lucas sequence U(P, Q)), 
the congruence 


S- —"=0 (mod p’), 


on all intervals J of maximal length with respect to the condition X,, 4 0. 
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Appendix: A Short Biography of Lucas 
with Comments on His Work and 
Scientific Legacy 


Name the greatest of all inventors. Accident. (Mark Twain) 


La chance ne sourit qu’aux esprits préparés. (Louis Pasteur) 


In what follows we provide a brief biography of Lucas, along with, in- 
evitably, our personal interpretations and commentaries. For more informa- 
tion concerning Lucas’ life and career, and how to better understand it within 
the social and political context of the time, together with his rich interactions 
with several contemporaries, we refer the reader to several papers of the late 
Anne-Marie Décaillot and her coauthors [4, 6, 8, 9], and to her very detailed 
Ph.D. thesis [7]. The collection of obituaries published in [3] also contains 
some interesting material. 


Francois Edouard Anatole Lucas was born on April 4, 1842, the eldest of 
a family of eight children in Amiens, France, an historic medium-size city 
situated some 115km north of Paris. His father’s trade was that of a cooper, 
which meant that his family was of modest financial means. His primary ed- 
ucation took place at a local parochial elementary school (’Ecole des Fréres). 
While there, he seems to have impressed his teachers as a very gifted child, 
and attracted the attention of Jacques Garnier (1808-1888), an active poly- 
math, who taught mathematics at the Imperial Lycée! in Amiens. Garnier 


+ A lycée is a secondary school for pupils aged 14-18. The “Grandes Ecoles,” Selective 
National Schools, are a system of higher education parallel to the university system, which 
was mostly put in place during the French Revolution and Napoleonic times, and continued 
to evolve in the nineteenth century. Up to the present, young people have to compete 
and obtain the best ranks in order to get into these schools; preparatory years were put 
into place within the lycées. At the time of Lucas, some lycées offered a first preparatory 
year, called a class of elementary mathematics (une classe de mathématiques élémentaires) 
at the end of which students could compete for some schools (such as Ecole navale or 
Saint-Cyr) or, enter a second preparatory year, called special mathematics (‘une classe de 
mathématiques spéciales’), which mostly prepared students for the Ecole Polytechnique 
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recommended to the local council that Lucas be awarded a municipal bur- 
sary to attend the Lycée. He got a 3/4 bursary from 1856 to 1858, which, at 
the request of his father and because of his excellent academic grades, was 
transformed into a full bursary henceforth. Thus, Lucas was able to pursue a 
secondary education, first at the Imperial Lycée in Amiens, completed from 
1859 to 1861 at the lycée in Douai, a city north of Amiens, in the northern 
part of France. In Douai he followed their special mathematics class in order 
to prepare for the contest for admission to the Ecole Polytechnique and to the 
Ecole Normale Supérieure (ENS). As was often the case, he did a second year 
of special mathematics. In fact, a waiver was required for him to compete for 
the ENS as he was too young at the end of his first year of special mathemat- 
ics. It seems, from a remark of the headmaster at Douai, that people intended 
him to pursue his studies with the ENS, from where he would be trained as 
an excellent teacher, rather than the—at the time—more prestigious Ecole 
Polytechnique. He was admitted in 1861 both to the Ecole Polytechnique 
and to the Ecole normale supérieure (ENS), which are still today two of the 
most prestigious postsecondary schools (Grandes Ecoles) in France. He de- 
cided in favor of the ENS, a school which, to this day, allocates a monthly 
state bursary to each of its attendees. After graduating in 1864, he secured 
second place in the mathematical sciences component of the competitive ex- 
amination (agrégation) conducted by the state for admission to posts on the 
teaching staff of the lycées and preparatory classes to the Grandes Ecoles. 
Incidentally, the individual who scored first place was Gaston Darboux [6] 
(1842-1917), who went on to a stellar career in mathematics (geometry and 
analysis). As Lucas was not yet 25, his official nomination as agrégé (i.e., 
someone who had successfully passed the agrégation) was delayed. 

Louis Pasteur, aware, as a number of personalities of the time, of the supe- 
riority of Germany over France in the sciences and the director for scientific 
studies at the ENS, did everything in his power to recruit and train excellent 
candidates who would both participate in the scientific life of France and pro- 
vide a high-quality education for their students, noted the inventive sagacity 
of Lucas in certain areas of mathematics. 

With the support of Pasteur! and at the request of Urbain Le Verrier, the 
Director of the Imperial Observatory in Paris, Lucas was attached to that 
establishment in September of 1864 with the title of Assistant Astronomer. 
Le Verrier was the famous astronomer who created great excitement in 1846 
when he was able to discover and locate the planet Neptune au bout de 
la plume (i.e., at the tip of his pen) based on observed perturbations on 
the orbit of Uranus. Even if we know today that the glory associated with 
this discovery should have been shared with the British mathematician and 


or the ENS, which had the same program. The vast majority of people admitted to the 
best schools did their preparatory years in Parisian lycées, but starting in the 1840s, a few 
provincial lycées had acquired a good reputation, as was the case of the lycée in Douai. 

+ From 1862 to 1866, 11 former students of the ENS were hired at the Imperial Observatory 
of Paris. 
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astronomer John Adams and that the calculations of Le Verrier were the 
object of later criticisms, one may understand that the young Lucas felt 
attracted by this opportunity. But the unpopularity of Le Verrier within the 
Observatory, especially in the late 1860s, is a well-established fact. Diverging 
views between Le Verrier and his collaborators created significant tension that 
grew stronger in late 1866. While Lucas’ professional work at the observatory 
went well, at least during the first 2 years, he left his position in late 1869 
after several stormy years of dealing with its Director, thus preceding the 
departure of a number of astronomers who resigned en masse early in 1870. 
Le Verrier’s years of dictatorial methods of management created such turmoil 
that he finally ended up being dismissed from his position in February of 1870, 
shortly before the end of the Second Empire (Dec. 1852—Sep. 1870). 


Much more information concerning Lucas’ tenure at the Observatory can 
be found in [7], about why and in what conditions, he stayed there in spite 
of Le Verrier’s wanting to fire him (and others) starting in late 1866. During 
this time, Lucas frequently sought out some escape from Le Verrier’s bullying 
by investigating mathematical problems during periodic visits to Amiens. 
Through the work of another native of Amiens, Edouard Gand (1815-1891), 
he became familiar with the mathematics associated with industrial weaving. 
His work on this formed the subject of his first publication in 1867 and several 
subsequent papers. (See [7, Chap. 7] and [8] for further material concerning 
this.) Indeed, at the conclusion of his paper [16], Lucas suggests that the 
laws of the mathematics of weaving might be enlisted in the construction of 
a mechanism to verify the primality of Mersenne numbers. 


Working at the Paris Observatory, in those days, involved a lot of routine 
work, which compounded with Le Verrier’s lack of flexibility, would have 
tested the patience and the desire to serve well of many young scientists, 
especially those trained at the ENS. Indeed, the work of an astronomer, back 
then, was essentially calculatory and mathematical: the astronomer would 
carry out by hand the necessary calculations for the raw data reduction of 
the observation; astrophysics was not yet born. An interesting anecdote in 
regard to Le Verrier’s attitude comes from Camille Flammarion’s memoirs. 
Flammarion had been recruited at the Imperial Observatory in 1858 at the 
age of 16, and he recalled the strong negative reaction of Le Verrier when he 
shared with him his intention of seeing Neptune with a telescope. Le Verrier 
would never have sought to see for himself the planet he had discovered 
mathematically! Surely the ingenuity of Lucas and his strong desire to learn 
and improve would have stumbled against the rigidity of Le Verrier. But the 
long hours spent on calculations could only have sensitized Lucas toward the 
importance, in general, of imagining efficient procedures to abbreviate raw 
calculations and make them less daunting and tiresome. The interest of Lucas 
for the practical and algorithmic aspects of computing is evident in much 
of his work. His activities around the mechanization and automatization of 
procedures constituted one of his main lifelong centers of interest. We will 
come back to this later. 
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Lucas’ career was interrupted by the July 19 outbreak of the Franco- 
Prussian War (1870-1871) in which he volunteered to serve in the Army of 
the Loire as an auxiliary lieutenant in the artillery. He took part, in particular, 
in the battles of Blois, Orléans, and Le Mans, served with distinction, being 
promoted to the rank of captain and was recommended twice for the Cross 
of the Legion of Honour, the greatest French order of merit for both military 
and civil service. During his military service, Lucas (see [20], XIV) conceived 
the problem of the “pile of cannonballs,” a Diophantine problem in which the 
solver is to find the conditions under which a pile of cannonballs with a square 
base and organized as a square pyramid contains a number of cannonballs 
which is the square of an integer. Lucas conjectured that there could only 
be 1 or 4900 such cannonballs, a result that was finally proved by G. N. 
Watson in 1919. For more information concerning this problem, see Anglin 
[1]. This simply stated problem may have kindled Lucas’ subsequent interest 
in Diophantine equations, for he deals at some length with such problems in 
[15] and [18]. 

On being discharged from the army, Lucas was faced with the problem 
of securing a teaching post. He was particularly interested in obtaining a 
position in Paris because he was working with Charles André (1842-1912) at 
the Imperial Observatory on a translation of Briinnow’s treatise on spherical 
and practical astronomy. One volume of this translation had already appeared 
in 1870, and Lucas wished to be placed in a situation that would allow him 
to continue to work on the second volume. Lucas’ contribution [20, VII] 
was to revise the instructions and provide additional tables resulting from 
his calculations. Unfortunately, Lucas’ request to be employed in Paris was 
turned down by the ministry, and instead he was appointed Professor of 
Special Mathematics (mathématiques spéciales) at the Lycée of Moulins, a 
provincial city some 300 km south of Paris, effective April of 1872. The second 
volume of the translation, with Lucas’ contributions, appeared in 1872. It was 
during his stay in Moulins that he met and married Marthe Boyron (1852- 
1882), the orphaned daughter of a Parisian lawyer; they went on to have two 
children: Paul and Madeleine. 


It seems that in Moulins, having been released from the turbulence and 
anxiety of the past several years, Lucas found the necessary serenity to pursue 
his mathematical interests. It was during this time that, very likely inspired 
by his interest in the Fibonacci sequence, he discovered the many properties 
of the Lucas sequences and applied these to the problem of the identifica- 
tion of large primes. Much more information concerning his contributions to 
primality testing can be found in Chap. 8 of [7] and in [26]. Suffice it to say 
that he was the first to recognize that certain large numbers could be proved 
prime, or composite, without recourse to a large number of trial divisions. We 
stress this was a fundamental and authentically novel discovery at the time, 
which today, as for many former fundamental discoveries, many of us take for 
granted. It is somewhat comparable to the invention of spectroscopy which 
brought about a qualitative denial to the philosopher Auguste Comte’s claim 
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that we would never know the matter of which the stars are made because 
they were simply too far away. It was also during this time that Lucas made 
his first of many successive communications [17] to the French Association for 
the Advancement of Science (AFAS). In this work he mentioned once again, 
but without providing any detail, the possibility of constructing a machine 
to determine whether or not a Mersenne number is a prime. 


As we said earlier, throughout his career Lucas was fascinated by the prob- 
lem of mechanizing the process of computation. It may well be that he had 
always had an interest in this area of research, but it is also very possible 
that the appeal of this subject began as a result of his investigation into the 
application of weaving technology to number-theoretic problems. He wrote 
a lengthy essay [18] on calculation and calculating machines for the 1884 
congress of AFAS in which he also introduced the work of Henri Genaille, a 
civil engineer working for the French railway, who was a very prolific inventor 
of calculating devices. This article was so admired that the first part of it 
was translated [19] into English for American readers. Lucas and Genaille 
also collaborated in the invention of several devices to simplify various com- 
mon types of computation; these objects were reviewed with approbation in 
Nouvelles Annales de Mathématiques [2]. Later, he encouraged Genaille in 
his construction of the Arithmetic Piano, a device which could be used to 
reduce the labor needed in using Lucas’ technique to verify the primality 
of Mersenne numbers. This instrument, which was described by Genaille to 
AFAS in 1891, is discussed in Sect. 7.2 of [26], in Chap. 12 of [7] and in [6]. 
Today there are 111 objects! associated with Lucas and 39 with Genaille in 
the Conservatoire des Arts et Métiers in Paris. 


With respect to AFAS, Décaillot [8] has pointed out that it was first 
founded in 1872 to promote the interests of French science, and a few years 
later was used as a platform by a group of mathematicians whose ambition 
was to “contribute to the diffusion of science to a wide audience.” To this end 
they dealt with mathematical questions that arose from concrete problems 
and made use of visual representations, particularly the chessboard. From 
1876 until the year of his death, Lucas was a very active member of AFAS, 
rising to the level of Chair of the sections devoted to mathematics, astronomy, 
geodesy, and mechanisms. 


Toward the end of his stay in Moulins, he submitted two theses to the 
Faculty of Science in Paris with the intention of being awarded a doctorate at 
Moulins in mathematical science. The subject matter of the first thesis ([20], 
XXIII) was geometry and of the second ([20], XXIV) was number theory— 
“the application of recurring series in the search for large prime numbers.” 
Both theses were approved for defense by the Dean of the Faculty and received 
permission for printing, but Lucas did not bother to defend them. Possibly, 


Tie, U3(11, 10) ..., an impressive number at any rate, which reveals the popular interest 
for science and technological innovation in the years 1860-1880 in France as Décaillot 
mentions [7], as much as Lucas’ involvement. 
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this was because his many pleas, since the conclusion of the war, for a position 
in Paris were finally being heard—he was offered a place in a Parisian lycée 
in the autumn of 1876. As this was the prize that Lucas was seeking, and as 
a doctorate at Moulins could only lead to a career in a provincial faculty, he 
likely felt that he should simply abandon this quest for a higher degree. 


In September of 1876, Lucas was appointed professor at the Lycée Charle- 
magne, where he taught for two consecutive years a class of elementary math- 
ematics until October of 1878; he then moved on to teaching a class of Special 
Mathematics, which covered the second part of the program for admission 
to the Grandes Ecoles, from 1878 to 1879. He was rapidly promoted! to a 
professorial position at the Lycée Saint-Louis, where he commenced teaching 
Special Mathematics in October of 1879. He was granted an extended leave 
of absence from Saint-Louis for two school years (1882-1884) as a result of 
the death of his wife of 9 years. Nevertheless, he continued his mathemati- 
cal activity during this time; indeed, he published the first two volumes of 
his mathematical recreations (Récréations Mathématiques) and devised the 
still popular game called the Tower of Hanoi, often used today as an exam- 
ple in explaining recursion to beginning computer science students. He first 
published this amusement under the pseudonym of N. Claus de Siam, an 
anagram of Lucas d’Amiens. For an extensive and well-researched discussion 
of the Tower of Hanoi, see Hinz et al. [12]. 


Lucas was described as a dynamic and enthusiastic man, who relished 
puns, a passionate scientist whose scholarship and manner charmed his audi- 
ence. Through his various activities (recreational mathematics, the works of 
Fermat, the mechanization of computing, his personal research papers, etc.), 
he was in constant epistolary correspondence with many people. His research 
output was impressive, but his teaching performance at Saint-Louis became 
a source of conflict. It is difficult to imagine Lucas, a true fountain of ideas, 
quick to establish links between different areas of mathematics, being resis- 
tant to digressions, because they did not fit with the program of preparatory 
classes. But most students at Saint-Louis wanted to perform well at the na- 
tional contests for which they were preparing. In 1890 Lucas’ originality, and 
probable lack of rigor in covering the program, possibly due in part to the 
many projects he was involved with, led to a minor revolt of several students 
at Saint-Louis. They threatened to leave the lycée if they were assigned to 
his class because they thought that his teaching style would not adequately 


+ In the sense that there always was a tacit ranking of the lycées that prepared for the 
Grandes Ecoles. The “better” lycées attracted both the better students and the better 
teachers. For instance, the headmaster of the Lycée Saint-Louis wrote [25] in his annual 
report of 1862: “M. Briot is known to be the best professor of all Parisian classes of special 
mathematics. His reputation is first-rate ...” A professor of mathematics who taught a 
class of “mathématiques spéciales” had only one class of students per year with about 
15 hours per week of in-class mathematics and extra hours for oral quizzing of students 
in small groups, usually of three. There never was any reference manual. Each professor 
designed his own course and wrote it on the blackboard with students dutifully taking 
notes for hours on. In French slang, such a student is called une taupe, i.e., a mole. 
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prepare them for the competitive examinations. The result of this was a 
transfer of Lucas back to the Lycée Charlemagne in August of 1890, where 
he remained until his untimely death in the fall of 1891. 


In 1891 Lucas completed the publication of his major work: Théorie des 
Nombres [23]. He had plans for publishing more volumes, but only the first 
volume ever appeared. This was the first book devoted to number theory to 
be published in France since the time of Legendre, almost 60 years earlier. 
Even though Lucas had some definite ideas for further material, it seems 
there were insufficient notes in his Nachlass for the production of any other 
volumes posthumously. (See [4], [7, Chap. 6], [11].) A charming and percep- 
tive review of [23] was published in 1892 by Laisant’ [13] shortly after Lucas’ 
death. Décaillot, on the fourth page of the introduction of her thesis [7], gives 
an insightful description of this singular and very original book. Today, we 
would perhaps best summarize it by saying it is an idiosyncratic collection of 
topics in Discrete Mathematics, including elementary number theory, recur- 
rences, graph theory, combinatorics, discrete probabilities, arithmetic devices, 
games, umbral calculus, the calculation of determinants, etc., nearly devoid 
of analytic methods. 


Lucas’ death came about as the result of a freak accident during an excur- 
sion in Provence, as part of the meeting of AFAS in Marseille (Sept. 18-28, 
1891). While he was dining, a member of the serving personnel dropped a 
pile of plates behind Lucas’ seat. A small piece of china flew up and caused a 
wound on Lucas’ cheek. This wound bled copiously, but was not regarded at 
the time as being of much importance. On the way back to Paris, however, 
Lucas’s cheek became inflamed and when he arrived home he took to his bed. 
He was diagnosed with erysipelas and died a few days later on October 3. 
Erysipelas is an illness of the upper dermis and by itself is rarely fatal, but one 
of its complications is necrotizing fasciitis (the so-called flesh-eating disease), 
which certainly answers to the description, une courte et terrible maladie 
supplied in the Forward of the third volume of Récréations Mathématiques 
[24], published posthumously in 1892. Lucas is buried in the Montmartre 
cemetery. (As reported by a mathematical friend, the tomb, in 1986, was in 
poor condition; Lucas’ name and that of his daughter, Madeleine, were barely 
legible, due to moss on the horizontal stone. In 2015, it was a sad sight; the 
vertical limestone had broken into pieces gathered on the horizontal stone. It 
lies close to Jacques Offenbach’s tomb.) 


+ Laisant (1841-1920) was an influential and very active personality and a close friend of 
Lucas with whom he shared mathematical interests and life experiences. He was a mili- 
tary officer, a parliamentary figure from 1876 to 1893, a freemason, meteorologist, math- 
ematician (his 1877 Ph.D. thesis was on mechanical applications of the quaternions), a 
pedagogue, president of the SMF (Société mathématique de France) in 1888, director of 
the Nouvelles Annales de Mathématique, co-founder of mathematical journals, including 
L’Enseignement Mathématiques, still active today. He “used” his political position to pro- 
mote Lucas on various occasions, e.g., as the head of a state-funded commission in charge 
of the reedition of Fermat’s work. 


286 Appendix: A Short Biography of Lucas 


During his relatively short career, Lucas wrote over 170 journal papers and 
15 other documents such as books and sundry miscellaneous monographs. 
The subjects of these works spanned areas as varied as astronomy, mathe- 
matics of weaving, geometry, analysis, combinatorics, enumerative geometry, 
recurring sequences, recreational mathematics, and calculating devices; but 
his main interest seems to have been number theory, particularly Diophantine 
analysis. In retrospect, it seems that his most important impact on mathe- 
matics was the inspired work he did between 1875 and 1878 on primality 
testing. To get some idea of the extent and variety of Lucas’ contributions, 
the reader is referred to Harkin’s bibliography [11] and to the more complete 
compilation of Lucas’ work at the website [14]. 


Unfortunately for Lucas, number theory was not in vogue in France during 
his lifetime; the subject was not taught in French universities, nor in the 
Grandes Ecoles. Number theorists, except for Hermite and a few others, were 
essentially secondary school professors and members of scientific associations, 
such as the AFAS. (See [10].) Instead, the emphasis in French mathematics 
during that period was on analysis and geometry. The importance of analysis 
is obvious in military contexts; many of the Grandes Ecoles, mainly in the first 
part of the nineteenth century, e.g., 1’X,' were also military schools, partly due 
to the influence of Napoleon. Diligent and clever attempts by C.-A. Laisant 
to establish a Chair in number theory at the Collége de France,’ proposing 
Lucas as an obvious candidate, were met with opposition from the French 
mathematical establishment and were ultimately unsuccessful. In addition 
to this national bias, Lucas’ use of elementary techniques, his emphasis on 
methods of computation, his delight in recreational mathematics, and his 
lack of an advanced degree likely united to ensure that he was relegated to 
the second tier of French mathematical scholarship. The effect of this was to 
cause his work to be largely ignored in France, until fairly recently. Perhaps 
it was this inattention which caused him to publish [20]. The name of Lucas 
in France was mostly associated with his books on recreational mathematics, 
and several of the games and themes they included. 


His most important paper [19] was solicited by the English mathematician 
J. J. Sylvester (1814-1897) for publication in the USA, where it appears in the 
very first volume of the American Journal of Mathematics. Indeed, Sylvester 
was the inaugural Professor of Mathematics at Johns Hopkins University 
in Baltimore, on the East coast of the USA, at the time he founded this 
journal in 1878. He was a correspondent of the French Academy of Sciences 
in Paris, and took part in several meetings of the AFAS. Thus, he knew 


+ The French abbreviation for l’Ecole Polytechnique. 

t A unique establishment situated in the Latin Quarter of Paris, formerly the Royal College, 
created by King Francis I in 1530, whose motto always was omnia docet, to teach every 
subject, and, in particular, subjects not taught in universities. Today it consists of about 50 
chairs, awarded to individuals at the forefront of their subject, in all areas of the humanities, 
the sciences, or the arts, who carry on research and deliver free lectures. There is a website 
and online access to these lectures. 
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Lucas well. Lucas’ creative memoir, with its theoretical, algorithmic, and 
calculatory aspects, and its numerous partial investigations got the attention 
fairly early of several American mathematicians such as E. T. Bell, D. N. 
Lehmer, R. D. Carmichael, D. H. and E. Lehmer, M. Ward, and others. 


Bell (1883-1960), a prolific mathematician and writer, who occasionally 
wrote a science-fiction novel in 3 weeks, and the author of the influential 
book Men of Mathematics, was fascinated by the works of both Liouville and 
Lucas. The first volume of the American Journal of Mathematics was left 
missing from the Library at Stanford for a lengthy period: Bell had borrowed 
it and was staying in San Francisco in 1906 when the devastating earthquake 
occurred; the volume probably burned. As mentioned in Chap. 6 of this book, 
Lucas had envisioned a connection between Fermat’s last theorem and the 
famous theorem of Jacobi concerning the impossibility of more than two 
periods for holomorphic functions of a single variable. In 1930, responding to 
a letter of Harkin on this subject, Bell wrote that he had “tried his darndest” 
to re-create this link for the last 28 years!' 


Morgan Ward (1901-1963) came to Pasadena where Bell was a faculty 
member at the California Institute of technology (Caltech) and actively par- 
ticipated in this quest. He published 33 articles on recurring sequences in his 
lifetime; many were influenced by Lucas’ memoir and were often innovative 
and beautiful contributions. D. N. Lehmer (a friend of Bell) who taught math- 
ematics at Berkeley from 1900 to 1937 and was the author of tables obtained 
by manual sieving (tables of factors; tables of prime numbers) transmitted to 
his son, D. H. Lehmer, the knowledge of Lucas and his taste for calculatory 
arithmetic and efficient algorithms. D. H. Lehmer (1905-1991), an influen- 
tial figure of the twentieth century in number theory and in computational 
methods, had two mathematical heros: Ramanujan and Lucas. The title of 
his 1930 Ph.D. thesis was An Extended Theory of Lucas’ Functions. Indeed, 
Lucas’ seminal contributions regarding factoring and primality testing were 
not truly appreciated until the 1930s in the USA when D. H. Lehmer be- 
gan the process of extending Lucas’ research. This study, especially since the 
advent of public-key cryptography in the 1970s, has since been taken up by 
many academics around the world. For details concerning the history of these 
developments and many references, see [26]. D. H. Lehmer was a close friend 
of Ward. 


Lehmer and his wife Emma initiated an informal gathering of number 
theorists on the eve of Christmas in 1969, which, today, has become an an- 
nual conference, the West Coast Number Theory Conference. At the end 
of the 1980s, it was a well-attended friendly meeting, typically attracting 
some 80 people. The Lehmers were often present; some former students of 
D. H. Lehmer (such as Brillhart, Graham, Stark, etc.), well-known or future 
well-known people were frequent participants, the likes of Bateman, Erdés, 
Granville, Lagarias, Odlyzko, Pomerance, Ribenboim, Selfridge, and many 


t See Sect. 6.3. 
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more, as well as a number of graduate students. The name of Lucas was fre- 
quently heard, mostly in relation to primality testing or to his sequences, and 
it almost seemed as if he were a great-uncle, distant, but whose deeds were 
still being recounted. 


Lucas’ name is pervasive in both the written and oral communications of 
the Fibonacci Association. The Fibonacci Association, legally based in Cali- 
fornia, was founded by V. E. Hoggatt in 1962. Besides editing the Fibonacci 
Quarterly, it has overseen since 1984 the organization of an international con- 
ference on Fibonacci numbers that meets every 2 years, alternating the host 
institution between Northern or Central America, and Europe. Many of the 
talks involve the Lucas sequences, Fibonacci or Lucas numbers, and, more 
generally, linear recurrences or special integer sequences. All talks are con- 
tributed, but starting in 2008, the Association has invited a plenary speaker, 
called the Lucas lecturer, to its conference. The first Lucas lecturer was Carl 
Pomerance (Patras, Greece); to date, the last was Florian Luca (2022, Sara- 
jevo, Bosnia-Herzegovina), and, along the way, in 2018, in Halifax, Canada, 
it was someone named Hugh Williams.’ Marjorie Bicknell-Johnson who has 
been a member of the Association since its inception has written several 
papers on its history. (See [5], in particular; it contains a list of her other 
writings on this history.) 

From a modern perspective, it is clear that Lucas was a pioneer in the study 
of efficient algorithms, particularly as they pertain to number theory and 
combinatorics; furthermore, in the pursuit of this activity, he was well ahead 
of his time. It is no exaggeration to regard him as a founder of the modern 
discipline of algorithmic number theory, and it is certainly appropriate that 
his name should be commemorated in the designation of the Lucas-Lehmer 
test for the primality of Mersenne numbers. In his hometown of Amiens, he 
is memorialized by having both a street (Rue Edouard Lucas) and a junior 
high school (Collége Edouard Lucas) named in his honor. 


Lucas had undertaken for himself the mission of reviving the triple tradi- 
tion coming from Fibonacci, Fermat (160z-1665)*, and Pascal (1623-1662), 
who, he wrote were far-sighted, when today we are so short-sighted. We recall 
that Pascal, in his youth, invented and supervised the manufacturing of what 
is often viewed as the first calculating machine, La Pascaline. Lucas might 
have added the tradition of recreational mathematics of Bachet (1581-1638), 
to which he greatly contributed with his four volumes of mathematical recre- 
ations. To him, these traditions went together and enriched each other. His 
curiosity and interest in the history of mathematics, especially in regard to 
the reedition of the collected works of Fermat, to which he contributed offi- 
cially for a few years, should also be pointed out. (See [7, Chap. 11] for letters 
exchanged between the proponents of the project and the detailed and some- 
time chaotic events that surrounded this project.) In short, Lucas (Fig. 1) 


+ Hugh avers that he did not write this paragraph. 
t Where « is either 1, 5, 7, or possibly 8. 
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was able to pursue what he liked most, with passion and no doubt joy, in 
spite of several adverse circumstances, something which may be perceived as 
a definition of success. 


Fig. 1 Edouard Lucas (1842-1891) 
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